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I© Expression of factor VII and IX activities in mammalian cells. 



CM 

O© Methods are disclosed for producing proteins 
having biological activity for blood coagulation me- 

LUdiated by Factor Vila or Factor IX. The proteins are 
produced by mammalian host cells which have been 
stably transfected with a DNA construct containing a 



nucleotide sequence which codes at least partially 
for either Factor VII or Factor IX. The nucleotide 
sequence comprises a first nucleotide sequence en- 
coding a calcium binding domain, joined to a second 
nucleotide sequence positioned downstream of the 
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first sequence. In particular, the first nucleotide se- 
quence may be derived from a genomic clone or 
cDNA clone of Factor VIL The second sequence 
encodes a catalytic domain for the serine protease 
activity of either Factor VllA or Factor IX. The joined 
sequences code for proteins having substantially the 
same biological activity for blood coagulation as 
either Factor Vila or Factor DO 
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EXPRESSION OF FACTOR VII AND IX ACTIVITIES IN MAMMALIAN CELLS 



Technical Held 

The present invention relates to blood coagula- 
tion factors in general, and more specifically, to the 
expression of proteins having biological activity for 
blood coagulation. 

Background Art 

Blood coagulation is a process consisting of a 
complex interaction of various blood components 
or factors which eventually gives rise to a fibrin 
clot Generally, the blood components which par- 
ticipate in -what has been referred to as the coagu- 
lation "cascade" are proenzymes or zymogens, 
enzymatically inactive proteins which are converted 
to proteolytic enzymes by the action of an activa- 
tor, itself, an activated clotting factor. Coagulation 
factors which have undergone such a conversion 
are generally referred to as"activated factors," and 
are designated by the addition of a lower case 
postscript "a" (e.g., Vila). 

There are two separate systems which can 
promote blood clotting and thereby participate in 
normal haemostasis. These systems have been 
referred to as the intrinsic and the extrinsic coagu- 
lation pathways. The intrinsic pathway refers to 
those reactions which lead to thrombin formation 
through utilization of factors present only in plasma. 
An intermediate event in the intrinsic pathway is 
the activation of Factor IX to Factor IXa, a reaction 
catalyzed by Factor Xla and calcium ions. Factor 
IXa then participates in the activation of Factor X in 
the presence of Factor Villa, phospholipid and cal- 
cium ions. The extrinsic pathway involves plasma 
factors as well as components present in tissue 
extracts. Factor VII, one of the proenzymes re- 
ferred to above, participates in the extrinsic path- 
way of blood coagulation by converting (upon its 
activation to Vila) Factor X to Xa in the presence of 
tissue factor and calcium ions. Factor Xa in turn 
then converts prothrombin to thrombin in the pres- 
ence of Factor Va, calcium ions and phospholipid. 
Because the activation of Factor X to Factor Xa is 
an event shared by both the intrinsic and extrinsic 
pathways, Factor Vila can be used for the treat- 
ment of patients with deficiencies or inhibitors of 
Factor VIII (Thomas. U. S. Patent 4,382,083). There 
is also some evidence to suggest that Factor Vila 
may participate in the intrinsic pathway as well (Zur 
and Nemerson, J. Biol. Che'm.- 253 : 2203-2209, 
1978) by playing a role in the activation of Factor 
IX. 



Experimental analysis has revealed that human 
Factor VII is a single-chain glycoprotein with a 
molecular weight of approximately 50,000 daltons. 
In this form, the factor circulates in the blood as an 

5 inactive zymogen. Activation of Factor VII to Vila 
may be catalyzed by several different plasma prot- 
eases, such as Factor Xlla. Activation of Factor VII 
results in the formation of two polypeptide chains, 
a heavy chain (M r = 28,000) and a light chain (M r 

w = 17,000), held together by at least one disulfide 
bond. Factor VII may also be activated to Vila in 
vitro, for example, by the method disclosed by 
Thomas in U.S. Patent No. 4,456,591 . 

Factor IX circulates in the blood as a single- 

15 chain precursor of molecular weight 57,000 and is 
converted to an active serine protease (Factor IXa) 
upon cleavage by Factor Xla in the presence of 
Factor VIII. Factor IXa consists of a light chain and 
a heavy chain of molecular weights 16,000 and 

20 _ 29,000. respectively. 

Current treatment practices for patients haying 
coagulation disorders (e.g., deficiencies of Factor 
VIII and IX) generally involve replacement therapy 
with cryoprecipitate or other fractions -of human 

25 plasma containing enriched levels of a particular 
factor. These preparations have heretofore been 
obtained from pooled human plasma, although the 
preparation of cryoprecipitates requires the use of 
a relatively large amount of human plasma as start- 

30 ing material. 

Therapeutic uses of Factor VII exist in the 
treatment of individuals exhibiting a deficiency in 
Factor VII, as well as Factor VIII and Factor IX 
deficient populations, and individuals with Von Wil- 

35 lebrand's disease. More specifically, individuals re- 
ceiving Factors VIII and IX in replacement therapy 
frequently develop antibodies to these proteins. 
Continuing treatment is exceedingly difficult be- 
cause of the presence of these antibodies. Patients 

40 experiencing this problem are normally treated with 
an activated prothrombin complex known to consist 
of a mixture of active and inactive clotting en- 
zymes, including Factor Vila. Further, recent stud- 
ies indicate that small amounts (40-50 microg- 

45 rams) of injected Factor Vila are effective in con- 
trolling serious on-going bleeding episodes in Fac- 
tor VIII deficient patients who have high levels of 
antibody in their blood (Hedner and Kisiel, J. Clin. 
Invest. 71: 1836-1841, 1983). 

so Due to the diverse sources of the plasma used 
in the preparation of cryoprecipitates, it is difficult 
to test the preparations to ensure that they are free 
of viral contamination. For instance, essentially all 
recipients of cryoprecipitate show a positive test for 

55 
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hepatitis. Recent reports have also indicated that 
some hemophiliacs receiving cryoprecipitate have 
developed acquired immune deficiency syndrome - 
(AIDS). In addition, the purification of large amounts 
of these factors is extremely difficult and expen- 
sive. 

Consequently, there exists a need in the art for 
a method of producing relatively large quantities of 
pure preparations of Factors Vila and Factor DC. 
The present invention fulfills this need through the 
use of recombinant DNA technology, successfully 
eliminating the problem of viral contamination and, 
at the same time, providing a consistent and homo- 
genous source of active Factor Vila to treat Factor 
VII and Factor IX deficient patients and individuals 
with Von Willebrand's disease, as well as providing 
a source of purified Factor IX for use in replace- 
ment therapy. 

Disclosure of the Invention 

Briefly stated, the present invention discloses a 
DNA construct containing a nucleotide sequence 
which codes at least partially for Factor VII. The 
nucleotide sequence comprises a first nucleotide 
sequence encoding a calcium binding domain 
joined to a second nucleotide sequence positioned 
downstream of the first sequence. The second 
nucleotide sequence encodes a catalytic domain 
for the serine protease activity of Factor Vila. The 
joined sequences code for a protein which upon 
activiation has substantially the same biological ac- 
tivity for blood coagulation as Factor Vila The first 
nucleotide sequence may be substantially that of a 
gene encoding Factor Vll, Factor IX, Factor X, 
Protein C. prothrombin, or Protein S. Further, the 
first nucleotide sequence may also encode a leader 
peptide corresponding to the respective gene. 

In particular, the first nucleotide sequence may 
be derived from a genomic clone or cDNA clone of 
Factor VII, and may encode the leader peptide and 
amino-terminal portion of Factor VII. The first 
nucleotide sequence may also include a double- 
stranded oligonucleotide. A particularly preferred 
first nucleotide sequence is that encoding the lead- 
er peptide and amino-terminai portion of Factor IX 

In addition, the present invention discloses re- 
combinant plasmids capable of integration in mam- 
malian host cell DNA. One of the plasmids includes 
a promoter followed downstream by a set of RNA 
splice sites, the RNA splice sites being followed 
downstream by a nucleotide sequence which 
codes at least partially for Factor VII. The 
nucleotide sequence comprises a first nucleotide 
sequence which encodes a calcium binding domain 
joined to a second nucleotide sequence positioned 



downstream of the first sequence. The second 
nucleotide sequence encodes a catalytic domain 
for the serine protease activity of Factor Vila. The 
joined sequences code for a protein which upon 

5 activation has substantially the same biological ac- 
tivity for blood coagulation as Factor Vila. The 
nucleotide sequence is then followed downstream 
by a polyadenyiatidn signal. 

Similar to the recombinant plasmid noted 

10 above, the present invention also discloses a sec- • 
ond plasmid which includes a promoter followed 
downstream by a set of RNA splice sites, the RNA 
splice sites being followed downstrream by a 
nucleotide sequence which codes at least partially 

75 for Factor IX. The nucleotide sequence comprises 
a first nucleotide sequence which encodes a cal- 
cium binding domain joined to a second nucleotide 
sequence positioned downstream of the first se- 
quence. The second nucleotide sequence encodes 

20 a catalytic domain for the serine protease activity 
of Factor IX. The joined sequences code for a 
protein having substantially the same biological ac- 
tivity for blood coagulation as Factor IX The 
nucleotide sequence is then followed downstream 

25 by a polyadenylation signal. 

A third aspect of the invention discloses mam- 
malian cells stably transfected to produce a protein 
having susbtantially the same biological activity, 
upon activation, as Factor Vila. The cells are trans- 

30 fected with a DNA construct containing a 
nucleotide sequence which at least partially codes 
for Factor VII. The nucleotide sequence comprises 
a first nucleotide sequence which encodes a cal- 
cium binding domain joined to a second nucleotide 

35 sequence positioned downstream of the first se- 
quence. The second nucleotide sequence encodes 
a catalytic domain for the serine protease activity 
of Factor Vila The joined sequences code for a 
protein which, upon activation, has substantially the 

40 same biological activity for blood coagulation as 
Factor Vila 

An additional aspect of the invention discloses 
mammalian cells stably transfected to produce a 
protein having substantially the same biological ac- 

45 tivity as Factor IX The cells are transfected with a 
DNA construct containing a nucleotide sequence 
which codes at least partially for Factor DC The 
nucleotide sequence comprises a first nucleotide 
sequence which encodes a calcium binding domain 

so joined to a second nucleotide sequence positioned 
downstream of the first sequence. The second 
nucleotide sequence encodes a catalytic domain 
for the serine protease activity of Factor DC The 
joined sequences code for a protein having sub- 

55 stantially the same biological activity for blood co- 
agulation as Factor DC 
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The present invention further provides for a 
method of producing a protein having biological 
activity for blood coagulation mediated by Factor 
Vila through establishing a mammalian host ceil 
which contains a DNA construct containing a 
nucleotide sequence which codes at least partially 
for Factor VII. The nucleotide sequence comprises 
a first nucleotide sequence which encodes a cal- 
cium binding domain joined to a second nucleotide 
sequence positioned downstream of the first se- 
quence. The second sequence encodes a catalytic 
domain for the serine protease activity of Factor 
Vila. The joined sequences code for a protein 
which, upon activation, has substantially the same 
biological activity for blood coagulation as Factor 
Vila. Subsequently, the mammalian host is grown 
in an appropriate medium and the protein product 
encoded by the DNA construct and produced by 
the mammalian host cell is isolated. The protein 
product is then activated to generate Factor Vila. 

Still a further aspect of the present invention 
discloses a method of producing a protein having 
biological activity for blood coagulation mediated 
by Factor IX. The method comprises establishing a 
mammalian host cell which contains a DNA con- 
struct containing a nucleotide sequence which 
codes at least partially for Factor IX. The nucleo 
tide sequence comprises a first nucleotide se- 
quence which encodes a calcium binding domain 
joined to a second nucleotide sequence positioned 
downstream of the first sequence. The second 
nucleotide sequence encodes a catalytic domain 
for the serine protease activity of Factor IX. The 
joined sequences code for a protein having sub- 
stantially the same biological activity for blood co- 
agulation as Factor IX. The mammalian host cell is 
subsequently grown in an appropriate medium and 
the protein product encoded by the mammalian 
host cell is isolated. Protein products produced by 
the methods noted above are also disclosed. 

Yet another aspect of the present invention 
discloses a DNA construct comprising a DNA se- 
quence encoding Factor VII. In a preferred embodi- 
ment, the DNA sequence comprises the cDNA 
sequence of Figure 1b from bp 36 to bp 1433. In 
another preferred embodiment, the DNA sequence 
comprises the cDNA sequence of Figure 1b from 
bp 36 to bp 99, followed downstream by the se- 
quence from bp 166 to bp 1433. Recombinant 
plasmids capable of integration in mammalian host 
cell DNA comprising the DNA sequences de- 
scribed immediately above are also disclosed. 

Mammalian cells stably transfected with a re- 
combinant piasmid comprising a DNA sequence 
encoding Factor VII are also disclosed. In preferred 
embodiments, the DNA sequence comprises the 



cDNA sequence of Figure 1b from bp 36 to 
bp1433, or the cDNA sequence of Figure 1b, from 
bp 36 to bp 99, followed downstream by the se- 
quence from bp 166 to bp 1433. 

5 A method for producing a protein having bio- 

logical activity for blood coagulation mediated by 
Factor Vila through establishing a mammalian host 
ceil that contains a DNA construct as described 
above is also disclosed. The mammalian host cell 

70 is subsequently grown in an appropriate medium, 
and the protein product encoded by the DNA con- 
struct is isolated. The protein product is then ac- 
tivated to generate Factor Vila. 

Other aspects of the invention will become 

75 evident upon reference to the following detailed 
description and attached drawings. 

Brief Description of the Drawings 

20 

Figure 1a illustrates the partial Factor VII 
cDNA sequence produced by joining por- 
tions of cDNA eiones XVII2115 and WII1923. 

25 Figure 1b illustrates the Factor VII cDNA 

sequence of XVN2463. Arrows indicate the 
extent of the deletion, in the sequence of 
XVH565. Numbers above the sequence des- 
ignate amino acids. Numbers below des- 

30 ignate nucleotides. 

Figure 2a illustrates the amino acid se- 
quences of the amino terminal regions of 
several clotting factors. 

35 

Figure 2b illustrates a comparison of the 
amino acid sequence of Factor VII obtained 
from protein sequencing with that encoded 
by the cDNA. 

40 

Figure 3 illustrates the joining of Factor IX 
leader sequences to a sequence encoding a 
consensus calcium binding domain. 

45 Figure 4 illustrates the joining of the Factor 

IX-consensus sequence hybrids to a partial 
Factor VII cDNA to produce an in-frame cod- 
ing sequence. 

so Figure 5 illustrates the construction of a pias- 

mid containing a coding sequence for a Fac- 
tor IX/Factor VII fusion protein. 

Figure 6 illustrates the expression vector 
55 FIXA/ll/pD2. Symbols used are Ad2 MLP, the 

major late promoter from adenovirus 2; L1-3, 
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the adenovirus 2 tripartite leader sequence; 
5'ss, 5" splice site 1 3'ss, 3' splice site f and 
pA, the late polyadenylation signal from 
SV40- 

Rgure 7 illustrates the nucleotide sequence 
of a Factor IX/Factor VII cDNA fusion. 

Figure 8 Illustrates expression vector 
pM7135. Symbols used are E, the SV40 
enhancer; ori, the 0-1 map units Ad 5; pA, 
the early polyadenylation signal from SV40; 
A, the deletion region of the pBR322 
"poison" sequences; and other symbols as 
described for Rgure 6. 

Rgure 9 illustrates the subclonmg of the 
2463bp Factor VII cDNA. 

Rgure 10 illustrates the subcloning of the 
565bp Factor VII cDNA. 

Rgure 1 1 illustrates the joining of the 5 T end 
of pVII565 and the 3' portion of pVII2463in 
pUC18 to generate pVII2397. 

Rgure 12 illustrates the construction of the 
expression plasmids FVII(2463)/pDX and 
J=VII(565 + 2463)/pDX. pA denotes the 
polyadenylation signal from SV40 in early or 
late orientation, as described in Example 9. 
Other symbols are as described for Rgure 8. 

Best Mode for Carrying Out the Invention 

Prior to setting forth the invention, it may be 
helpful to an understanding thereof to set forth 
definitions of certain terms to be used hereinafter. 

Complementary DNA or cDNA: A DNA mol- 
ecule or sequence which has been enzymatically 
synthesized from the sequences present in a 
mRNA template. 

DNA Construct : A DNA molecule, or a clone of 
such a molecule, either singie-or double-stranded, 
which may be isolated in partial form from a natu- 
rally occurring gene or which has been modified to 
contain segments of DNA which are combined and 
juxtaposed in a manner which would not otherwise 
exist in nature. 

Plasmid or Vector A DNA construct containing 
genetic information which may provide for its repli- 
cation when inserted into a host cell. A plasmid 
generally contains at least one gene sequence to 



be expressed in the host cell, as well as sequences 
which facilitate such gene expression, including 
promoters and transcription initiation sites, it may 
be a linear or closed circular molecule. 

5 Joined : DNA sequences are said to be joined 

when the 5 r and 3' ends of one sequence are 
attached, by phosphodiester bonds, to the 3' and 5* 
ends, respectively, of an adjacent sequence. Join- 
ing may be achieved by such methods as ligation 

w of blunt or cohesive termini, by synthesis of joined 
sequences through cDNA cloning, or by removal of 
intervening sequences through a process of di- 
rected mutagenesis. 

Leader Peptide : An amino acid sequence 

is which occurs at the amino terminus of some pro- 
teins and is generally cleaved from the protein 
during subsequent processing and secretion. Lead- 
er peptides comprise sequences directing the pro- 
tein into the secretion pathway of the cell. As used 

20 herein, the term "leader peptide' may also mean a 
portion of the naturally occurring leader peptide. 

Domain: A three-dimensional, self-assembling 
array of specific amino acids in a protein molecule 
which contains all or part of the structural elements 

26 necessary for some biological activity of that pro- 
tein. 

Biological Activity : A function or set of func- 
tions performed by a molecule in a biological con- 
text (i.e. , in an organism or an in vitro facsimile). 

30 Biological activities of proteins may be divided into 
catalytic and effector activities. Catalytic activities 
of clotting factors generally involve the activation of 
other factors through the specific cleavage of pre- 
cursors. Effector activities include specific binding 

35 of the biologically active molecule to calcium or 
other small molecules, to macromolecules such as 
proteins, or to cells. Effector activity frequently 
augments, or is essential to, catalytic activity under 
physiological conditions. Catalytic and effector ac- 

40 tivities may, in some cases, reside within the same 
domain of a protein. 

For Factor Vila, biological activity is character- 
ized by the mediation of blood coagulation through 
the extrinsic pathway. Factor Vila activates Factor 

45 X to Factor Xa, which in turn converts prothrombin 
to thrombin, thereby initiating the formation of a 
fibrin clot. Because the activation of Factor X is 
common to both the extrinsic and intrinsic path- 
ways of blood coagulation, Factor Vila may be 

so used to treat Individuals severely deficient in the 
activities of Factor IX, Factor VIII or Von Wiltebrand 
Factor. 

The biological activity of Factor IX is character- 
ized by the mediation of blood coagulation through 
55 the intrinsic pathway. Factor IX is activated to Fac- 
tor IXa by Factor XIa. Factor IXa then activates 
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Factor X to Factor Xa in the presence of Factor 
Villa, phospholipid, and calcium ions. Factor Xa 
then acts in the conversion of prothrombin to 
thrombin, initiating the formation of a fibrin clot. 

As noted above, the isolation of Factor VII from 
human plasma is a time-consuming and expensive 
process since the factor is a rare protein present 
only at a concentration of approximately 300 micro- 
grams per liter of blood. In addition, it is difficult to 
separate from prothrombin, Factor IX and Factor X 
and is susceptible to proteolytic attack during pu- 
rification (Kisiel and McMullen, ibid). Although 
single-chain human Factor VII has been purified to 
homogeneity (Kisiel and McMullen, ibid), the pub- 
lished purification methods are generally limited by 
low yield and/or contamination by other coagulation 
factors. 

Factors VII and IX are produced in the liver and 
require vitamin K for their biosynthesis. Vitamin K 
is necessary for the formation of specific gamma- 
carboxyglutamic acid residues in the factors. These 
unusual amino acid residues, which are formed by 
a post-translational modification, bind to calcium 
ions and are responsible for the interaction of the 
protein with phospholipid vesicles. In addition, Fac- 
tors VII and IX each contain one 0-hydroxyaspartic 
acid residue which is also formed after the proteins 
have been translated. However, the role of this 
amino acid residue is not known. 

Given the fact that the activities of Factor VII 
and IX are dependent upon post-translational modi- 
fications involving the gamma carboxylation of spe- 
cific glutamic acid residues, and may also be de- 
pendent upon the hydroxylation of a specific aspar- 
tic acid residue, it is unlikely that an active product 
could be produced through the cloning and expres- 
sion of Factors VII" and IX in a microorganism. 

Accordingly, the present invention provides a 
method of producing a protein having biological 
activity for blood coagulation mediated by Factor 
Vila using stably transfected mammaiian cells. In 
addition, the present invention also provides a 
method of producing a protein having biological 
activity for blood coagulation mediated by Factor 
IX. 

As noted above. Factors VII and IX require 
vitamin K for their biosynthesis. In addition, the 
plasma proteins prothrombin, Factor X, Protein C, 
and Protein S also require vitamin K for their bio- 
synthesis. The amino-terminal portions of these 
proteins, which contain gamma-carboxyglutamic 
acid residues, are homologous in both amino acid 
sequence and in biological function (Figure 2a). 
Further, the carboxy-terminal portions of Factor VII, 
prothrombin, Factor IX. Factor X, and Protein C. 
determine their specific serine protease functions. 



Factor VII is a trace plasma protein, and the 
mRNA encoding Factor VII is believed to be rare. 
Consequently, purification of Factor VII from plas- 
ma in sufficient quantities to permit extensive se- 

5 quence analysis and characterization remains dif- 
ficult. Degradation of Factor VII during purification, 
even in the presence of protease inhibitors, was 
noted by Kisiel and McMullen (ibid). Due to these 
difficulties, Factor VII has been poorly character- 

70 ized. compared to . other more abundant compo- 
nents of the blood coagulation system. Indeed, the 
work of Kisiel and McMullen (ibid) yielded se- 
quence information for only 10 residues of each 
chain of Factor VII, and in each sequence the 

T5 identification of two residues was tentative. Partial 
amino acid sequence data for Bovine Factor VII 
have also been published (DiScipio et al., ibid). 

The presumed rarity of Factor VII- mRNA has 
contributed to the lack of knowledge of the Factor 

20 VII gene. The success of conventional cDNA clon- 
ing techniques is dependent on a sufficient quantity 
of mRNA for use as a template. Premature termina- 
tion of reverse transcription results in the produc- 
tion of cDNA clones lacking the 5* end and this 

2$ condition is exacerbated by low mRNA levels. Sev- 
eral strategies for cDNA cloning of low abundance 
message have been developed (Maniatis et al., 
Molecular Cloning: A Laboratory Manual . Cold 
Spring Harbor Laboratory, 1982), but a lack of 

30 knowledge of the amino acid sequence of the prod- 
uct of interest makes it impossible to predict the 
DNA sequence and to design appropriate 
oligonucleotide probes. While it may be relatively 
straightfoward to obtain a partial cDNA clone of a 

35 gene encoding a rare protein by using these ad- 
vanced strategies, full-length cDNA clones of 
genes encoding rare proteins such as Factor VII 
remain exceedingly difficult to obtain. 

In comparison to Factor VII, Factor IX is a 

40 relatively abundant protein and the sequence of a 
cDNA clone of the human Factor IX gene is known 
(Kurachi and Davie, Proc. Natl. Acad. Sci. USA 79: 
6461-6464, 1982; and Anson et al., EMBO 
1053-1060, 1984). The structure of the Factor IX 

45 gene has been characterized and the amino acid 
sequence of the protein has been determined on 
the basis of the known nucleotide sequence. Some 
protein sequence data have also been published 
for human and bovine Factor IX and the sequences 

so analyzed (DiScipio et al., ibid). The amino terminal 
portion of the protein contains 12 glutamic acid 
residues that are converted to 7-carboxyglutamic 
acid (Gla) residues in the mature protein. The 
cleavage sites involved in the activation of Factor 

55 IX have also been identified (Kurachi and Davie, 
ibid). A sequence at the 5' end of the Factor IX 
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cDNA clone codes for a signal peptide which is 
typical of those found in most secreted proteins - 
(Kurachi and Davie, ibid). The expression of the 
Factor IX gene through recombinant DNA methods 
has not been previously reported. 

Because of the difficulty in obtaining a full- 
length cDNA clone of the Factor VII gene, three 
novel approaches were adopted to supply the 5' 
end of the coding sequence, Including the region 
encoding the leader peptide. According to the first 
method, a partial cDNA clone for Factor VII is 
joined to a fragment encoding the leader peptide 
and 5' portion of Factor DC This approach is based 
on the observation that the amino-terminal portions 
of the two molecules are responsible for the cal- 
cium binding activities of the respective proteins 
and the discovery that the calcium binding activity 
of Factor IX can substitute for that of Factor VII. 
The resultant polypeptide retains the biological ac- 
tivity of authentic Factor VII because the specific 
serine protease activities of the coagulation factors 
reside in the carboxy-terminal regions of the mol- 
ecules. The second approach combines the partial 
cDNA clone with a DNA sequence encoding the 
leader and amino-terminal regions of Factor VII. 
The partial cDNA and amino acid sequences of 
Factor VII disclosed herein enable the screening of 
a genomic DNA library or cDNA library foe clones 
comprising the 5 f portion of the Factor VII gene. 
The third approach involves joining the partial 
cDNA clone to hybrid coding sequences compris- 
ing a cDNA fragment encoding the leader peptide 
of Factor DC and a synthetic gene segment encod- 
ing a consensus calcium binding domain or a pre- 
dicted amino terminal sequence for Factor VII. The 
coding sequence for the amino terminus of Factor 
VII was established through previously unpublished 
amino acid sequence data disclosed herein. The 
consensus sequence was derived from the factor 
VII data and published sequence data for other 
vitamin K-dependent plasma proteins. 

Consistent with the approach described above 
for screening for clones comprising the 5' portion 
of the Factor VII gene, the inventors have been 
successful in obtaining a full-length, correct cDNA 
that is suitable for expression. 

Among the cDNA clones that were generated,* 
a clone designated " XVII2463" contained the larg- 
est Factor VII cDNA insert. It was found to contain 
the entire coding sequence for Factor VII. This 
clone included a 35 nucleotide 5' untranslated re- 
gion, 180 nucleotides coding for a 60 amino acid 
leader, 1218 nucleotides coding for the 406 amino 
acid mature protein, a stop codon, 1026 
nucleotides of 3 ? untranslated sequence, and a 20 
base poly(A) tail (beginning at position 2463). This 
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cDNA has now been sequenced in its entirety on 
both strands. A comparison of it with two cDNA 
inserts isolated earlier from clones XVH2115 and 
XV1II923, revealed that XVH2463 contains, on a sin- 

5 gle EcoRI fragment a Factor VII cDMA coding for 
Factor VII leader and mature protein sequences. 

A second clone, XVII565, was isolated that con- 
tained a cDNA insert that was identical to the cDNA 
of done XVII2463 from nucleotide 9 to nucleotide 

70 638, except that it lacked nucleotides 100 to 165 - 
(Figure 1b). In comparing the cDNAs to Factor VII 
genomic DNA, the absent sequences correspond 
precisely to one exon-like region. Therefore, two 
Factor VII cDNAs have been obtained which ap- 

75 pear to reflect alternative mRNA splicing events. 

The leader encoded by XVH2463 is exception- 
ally long (60 amino acids) and has a very different 
hydrophobicfty profile when compared with Factor 
IX, protein C and prothrombin. This leader contains 

20 two mets, at positions -60 and -26. Initiation most 
likely beigns at the first met since a hydrophobic 
region, typical of signal peptides, follows the met at 
position -60, but not the met at -26. It is interesting 
that the absent sequence in XVU565, which cor- 

25 .responds precisely to an exon-like region in the 
genomic clone, results in a 38 amino add leader 
with a hydrophobicity pattern more analogous to 
Factor DC protein C, and prothrombin. 

Since it was not clear then which, if either, of 

30 the leaders described above was authentic, an ad- 
ditional approach was initiated in an effort to ana- 
lyze the 5' end sequence. Briefly, this approach 
included the construction and screening of a hu- 
man genomic DNA library, and the identification of 

35 genomic clones comprising Factor VII gene se- 
quences. The 5* portion of the genomic sequence 
was subseqently joined to the cDNA to construct a 
full-length clone. 

Jn an additional construct a 5* Factor VII cDNA 

40 fragment of XVI1565 containing all of the leader and 
29 amino adds of the mature coding sequence was 
ligated to a fragment of the cDNA of XVII2463 - 
(containing the remainder of the mature protein and 
^-untranslated sequences). This "565-2463" se- 

45 quence encodes a full-length Factor VII cDNA se- 
quence as a single EcoRI fragment. 

The DNA sequences described above are then 
inserted into a suitable expression vector which is 
in turn used to transfect a mammalian cell One. 

50 Expression vectors for use in carrying out the 
present invention will comprise a promoter capable 
of directing the transcription of a foreign gene in a 
transfected mammalian cell. Viral promoters are 
preferred due to their efficiency in directing tran- 

55 scription. A particulary preferred such promoter is 
the major late promoter from adenovirus 2. Such 
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expression vectors will also contain a set of RNA 
splice sites located downstream from the promoter 
and upstream from the insertion site for a gene 
encoding a protein having biological activity for 
blood coagulation. Preferred RNA splice site se- 
quences may be obtained from adenovirus and/or 
immunoglobulin genes. Also contained in the ex- 
pression vectors is a polyadenylation signal, lo- 
cated downstream of the insertion site. Viral 
polyadenylation signals are preferred, such as the 
early or late polyadenylation signals from SV40 or 
the polyadenylation signal from the adenovirus 5: 
Elb region. In a particularly preferred embodiment, 
the expression vector also comprises a viral leader 
sequence, such as the adenovirus 2 tripartite lead- 
er, located between the promoter and the RNA 
splice sites. Preferred vectors may also include 
enhancer sequences, such as the SV40 enhancer. 

Cloned DNA sequences may then be intro- 
duced into cultured mammalain cells by calcium 
phosphate mediated transfection. (Wigler et al., 
Cell 14: 725, 1978; Corsaro and Pearson, Somatic 
Cell Genetics 7: 603, 1981; Graham and Van der 
Eb. Virology 52: 456, 1973.) A precipitate is formed 
of the DNA and calcium phosphate and this 
precipitate is applied to the cells. A portion of the 
cells take up the DNA and maintain it inside the 
cell for several days. A small fraction of the cells - 
(typically 10 stably integrate the DNA into the 
genome. In order to identify these stable integrants, 
a gene ^hat confers a selectable phenotype (a 
selectable marker) is generally introduced along 
with the gene of interest Preferred selectable 
markers include genes that confer resistance to 
drugs, such as G-418 and methotrexate. Selectable 
markers may be introduced into the cell dn a 
separate plasmid at the same time as the gene of 
interest or they may be introduced on the same 
plasmid. A preferred selectable marker is the gene 
for resistance to the drug G-418, which is carried 
on the plasmid pKO-neo (Southern and Berg, J. 
Mol. AdoI. Genet? 1: 327-341, 1982). It may also be 
advantageous to add additional DNA, known as 
"carrier DNA," to the mixture which is introduced 
into the cells. After the cells have taken up the 
DNA, they are allowed to grow for a period of time,, 
typically 1-2 days, to begin expressing the gene of 
interest Drug selection is then applied to select for 
the growth of cells which are expressing the selec- 
table marker in a stable fashion. Clones of such 
cells may be screened for expression of the protein 
of interest. 

Factor VII and Factor IX produced byjhe trans- 
fected cells may be removed from the cell culture 
media by adsorption to barium citrate. Spent me- 
dium is mixed with sodium citrate and barium 



chloride and the precipitate col lected. The 
precipitated material may then be assayed for the 
presence of the appropriate clotting factor. Further 
purification may be achieved through immunoad- 

5 sorption. It is preferred that the immunoadsorption 
column comprise a high-specificity monoclonal 
antibody. Alternatively, purification of the barium 
citrate precipitated material may be accomplished 
by more conventional biochemical methods or by 

70 high-performance liquid chromatography. 

Conversion of single-chain Factor VII to active 
two-chain Factor Vila may be achieved using Fac- 
tor Xlla as described by Hedner and Kisiel (J. Clin. 
' Invest. 71; 1836-1841, 1983), or with other prot- 

75 eases having trypsin-like specificity (Kisiel and 
Fujikawa, Behrino Inst Mitt. 73: 29-42, 1983). 

In summary, the present invention provides a 
method for the production of proteins having the 
activity of vitamin K-dependent blood coagulation 

20 factors using transfected mammalian cells. Gene 
sequences encoding the specific serine protease 
domains of the coagulation factors are . isolated 
from cDNA libraries. Sequences encoding the lead- 
er peptides and calcium binding domains are iso- 

25 lated from cDNA or genomic libraries or construct- 
ed from synthesized oligonucleotides. The se- 
quences are then joined in an appropriate expres- 
sion vector so as to encode a protein having the 
desired biological activity for blood coagulation. 

30 The resulting vector and a plasmid containing a 
drug resistance marker are co-transfected into ap- 
propriate mammalian tissue culture cells. Transfec- 
ted cells may then be selected by addition of the 
appropriate drug, such as G-418. The protein pro- 

35 ducts are then purified from the cell growth media 
and assayed for biological activity in a blood co- 
agulation assay and for immunological cross-reac- 
tivity using antibodies prepared against authentic 
human clotting factors. 

40 To summarize the examples which follow. Ex- 
ample 1 discloses the cloning of a full-length cDNA 
sequence for Factor VII. Example 2 discloses a 
partial amino acid sequence of human Factor VII, 
including the sequence of approximately 30 amino 

45 acids at the amino terminus. Example 3 discloses 
the construction and screening of a human 
genomic DNA library and the identification of 
genomic clones comprising Factor VII gene se- 
quences. Example 4 discloses the construction of 

so two hybrid gene segments, each comprising a 
cDNA fragment encoding the leader peptide of 
Factor IX and a synthesized double-stranded frag- 
ment encoding a consensus calcium binding do- 
main. The hybrid sequences are then joined to 

55 partial cDNA clones of Factor VII. Using in vitro 
mutagenesis, the consensus sequence was then 
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altered to conform to the protein sequence data for 
Factor VII. Example 5 describes the construction of 
a gene sequence encoding a fusion protein com- 
prising the calcium binding domain of Factor IX 
and the specific serine protease domain of Factor 
V1L Example 6 describes the construction of the 
vector pD2 for use in expressing proteins having 
biological activity for blood coagulation in transfec- 
ted mammalian cells. The gene fusion described in 
Example 5 is expressed using this vector. Example 
7 describes the use of the vector pD2 to express a 
gene for Factor IX in a transfected mammalian cell 
line. Example 8 describes the construction of the 
vector pM7135, which contains DNA sequences 
encoding a primary translation product comprising 
the leader sequence of Factor IX fused to Factor 
VII. This vector may be used to produce a protein 
having the activity of Factor VII in a transfected 
mammalian cell line. Example 9 describes the ex- 
pression of Factor VII using cDNA sequences, and 
the expression of Factor VII from a genomic-cDNA 
hybrid sequence. 

The following examples are offered by way of 
illustration and not by way of limitation. 

EXAMPLES 

Restriction enzymes were obtained from Be- 
thesda Research Laboratories (BRL) and New Eng- 
land Biolabs and were used as directed by the 
manufacturer, unless otherwise noted. 
Oligonucleotides were synthesized on an Applied 
Biosystems Model 380 A DNA synthesizer and 
purified by polyacrylamide gel electrophoresis on 
denaturing gels. EL coli cells were transformed as 
described by Maniatis et al. (Molecular Clonino: A 
Laboratory Manual. Cold Spring Harbor Laboratory, 
1982). M13 and pUC cloning vectors and host 
strains were obtained from BRL Factor VII was 
prepared from human plasma as described by 
Kisiel and McMulien (ibid). 

Example 1: Cloning of a Partial Factor VII cDNA. 



A. Construction of a human liver cDNA library. 

A cDNA library was prepared from human liver 
mRNA by the method of Chandra et al., Proc. Natl. 
Acad. Sci. U.S.A. 80 : 1845-1848, T983. The cDNA 
preparation was sedimented through an alkaline 
sucrose gradient (Monahan et al., Biochemistry 15: 
223-233, 1976) and fractions containing species of 
greater than about 1000 nucleotides were pooled. 
The first strand preparation was made double- 
stranded using reverse transcriptase (Chandra et 



al., 1983), treated with S1 nuclease, and the resid- 
ual staggered ends fiiled-in using DNA Polymerase 
I (Klenow fragment) in the presence of all four 
deoxyribonucleotide triphosphates (Maniatis et al., 

5 Molecular Cloning: A Laboratory Manual , Cold 
jSpring Harbor Laboratory, 1982). The blunt-ended 
cDNA was treated with Eco Rl methylase and Bgat- 
ed to phosphorylated Eco Rl linkers using T* DNA 
iigase (Maniatis et al., ibid). The Hgated DNA prep- 

10 aration was exhaustively digested with Eco Rl to 
remove excess linker sequences and double- 
stranded DNAs greater than about 1000 base pairs 
in length were purified by neutral sucrose gradient 
centrifugation (Maniatis et al., ibid). Native XgtH 

75 DNA was ligated into concatemers, digested to 
completion with Eco Rl, and the 5' terminal phos- 
phates were removed by treatment with bacterial 
alkaline phosphatase. The pooled human liver 
cDNA was ligated with the phage DNA, packaged 

20 jn vitro (Maniatis et al., ibid), and used to infect E. 
coli Y1088 (Young and Davis, Science. 778- 
782, 1983). Approximately 14 x 10 fi primary phage 
plaques were generated in this library, composed 
of seven libraries of - 2 x 10 s plaques each. 

25 Greater than 90% of these were recombinants con- 
taining human DNA inserts, based on their lack of 
jS-gaiactosidase activity and characterization of 20 
random clones by Eco Rl digestion followed by 
agarose gel electrophoresis. The cDNA library, in 

30 the form of phage particles, was purified by cesium 
chloride gradient centrifugation and stored in SM 
buffer (Maniatis et aL, ibid). 

B. Screening of the human liver cDNA library for 
35 Factor VII clones. 

The human liver expression cDNA library de- 
scribed above was screened for specific antigen - 
(Young and Davis, ibid) using an 125 [-labeled mon- 

40 oclonal Factor VII antibody prepared by the method 
of Brown et al. GL BioLChem. 225 : 4980-4983, 
1980) using purified Factor VII. Screening of 6 x 
10* phage plaques identified one isolate, desig- 
nated XVII2115, which gave a positive response 

45 with the antibody. 

The phage clone XVII2115 was tested against 
two other anti-Factor VII monoclonal antibodies and 
a rabbit polyclonal antibody to Factor VII. Isolate 
XVII2115 gave a positive response to all these arrti- 

50 Factor VII antibodies. 

DNA was prepared from a plate lysate 
(Maniatis et al., pp. 65-66. 1982) of XVII2115. Di- 
gestion of this DNA with Eco Rl liberated an insert 
of 2139 base pairs. This insert was subcloned into 

55 M13 phage vectors (Messing, Meth. in Enzvmoloav 
101 : 20-77, 1983; and Norrander et al.. Gene 28 : 
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101-106, 1983) for chain termination dideoxy DNA 
sequencing (Sanger et al., Proc. Natl. Acad. ScjL 
U.S.A. 74- 5463-5476, 1977). This cDNA insert con- 
tains Pst i sites at positions 214, 839. and 1205 - 
(designated Pst la, Pst lb, and Pst Ic, respectively, 
in Figure 1a) and a Sma I site located at position 
61 1. The following Ml 3 templates were sequenced: 

1) full-length (2139 bases) Eco Rla — Eco 
Rib fragment in M13mp18 (designated clone 
F7-1); 

2) Pst la — Eco Rla 214 base fragment in 
M13mp19 (F7-2); 

3) Pst la — Pst lb 625 base fragment in 
M13mp18(F7-3); 

4) Pst lb — Pst la 625 base fragment in 
M13mp18(F7-7); 

5) Sma I — Pst lb 228 base fragment in 
M13mp10 (F7-8); 

6) Pst lb -* Pst Ic 366 base fragment in 
M13mp18(F7-9); 

7) Pst Ic — Pst lb 366 base fragment in 
M13mp18(F7-10); 

8) Pst Ic — Eco Rib 930 base fragment in 
M13mp19(F7-11);and 

9) Eco Rib — Eco Rla full-length fragment in 
M13mp18 (F7-12) (restriction site designa- 
tions refer to Figure 1a). 

The data confirmed the sequence on both 
strands for 91% of the coding region and 
15% of the 3' non-coding region and yielded 
single-stranded sequence information for the 
remaining 9% of the coding region and 85% 
of the non-coding region. 
Comparison of the amino acid sequence pre- 
dicted from the cDNA sequence with the known 
amino acid sequence data of Kisiel and McMuiten - 
(Thrombosis Research 22: 375, 1981) and the ami- 
no acid sequence shown below (Example 2) re- 
vealed an anomaly which could be explained by 
the absence of three nucleotides in the DNA se- 
quence near position 400. To obtain additional se- 
quence data, XVII2115 was digested with Eco Rl, 
and the Factor VII coding fragment was inserted 
into pUC 13 (Vieira and Messing, Gene 19: 259- 
268, 1982; and Messing, ibid) which had been 
digested with Eco Rl. The resultant recombinant 



plasmid, designated pUCVII2115, was digested 
with Xba I which cut at position 328. The digested 
sample was divided in half: half was labeled with 
a 32 P dCTP and DNA Polymerase I (Klenow frag- 

5 ment) (Englund, P.T., J. MoL Bio. 66 : 209, 1972); 
the other half was labeled with y* 2 ? ATP and 
polynucleotide kinase (Chaconas et a)., Biochem. 
Biophvs. Res. Comm. 66: 962, 1975). The labeled 
plasmids were then recut with Pst I to yield 113 

70 and 509 base pair fragments. Both strands of each 
of these were sequenced by the method of Maxam 
and Gilbert ( Meth. in Enzvmoloov 74: 560, 1980). 
The 113 base pair fragment was sequenced in its 
entirety and 210 base pairs of the 509 base pair 

75 fragment were sequenced. These sequences re- 
vealed three additional bases ( one C and two G's) 
which rendered the DNA sequence data in agree- 
ment with the protein sequence data, indicating 
that the previous anomalous results arose from 

20 compressions on the sequencing gel due to secon- 
dary structure involving G's and C's. The sequence 
of the last 9% of the coding region on both strands 
was also confirmed. 

Further analysis of the sequence of the 

25 pUCVII2115 insert confirmed that a portion of this 
cloned fragment encoded a sequence of 1 1 amino 
acids known to be at the cleavage site of Factor VII 
(Kisiel and McMullen, Thrombosis Research 22: 
375, 1981). Comparison of this sequence to Factor 

30 IX (Davie et al., ibid) and Factor X (Leytus et al., 
Proc. Natl. Acad. Sci. U.S.A. 81; 3699-3702, 1984) 
amino acid sequences suggested that the clone 
contained the sequence for Factor VII beginning at 
(approximately) nucleotides coding for amino acid 

35 36 of the mature Factor VII protein and continuing 
through approximately 1000 coding and 1100 non- 
coding nucleotides and poly A sequence. In addi- 
tion, it was found that this clone had frameshift 
mutations in the 3* coding portion. 

40 In order to obtain the correct 3' coding region, 
all 14 million clones of the seven \gt11 cDNA 
libraries were screened by plaque hybridization - 
(Benton and David, Science 196 : 180-181, 1977) 
with nick-translated cDNA of XVII2115 (Maniatis et 

45 al., pp. 109-112, 1982). 

Seven positive isolates were then screened by 
dideoxy sequencing of pUC plasmids into which 
the cDNA inserts had been subcloned (Wallace et 
al., Gene 16 : 21, 1981). The XgttL clones were 

so digested with Eco Rl and the Factor VII fragments 
were inserted into pUC13 which had been cleaved 
with Eco Rl. All except one of these were found to 
start at a position corresponding to base 212 of the 

55 
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insert in XVII2115; the one exception consisted only 
of 3' non-coding sequence. One of the clones 
starting at base 212 was selected for analysis and 
was designated clone pUCVM923. 

Because analysis of pl!CVLI2115 indicated the 
presence of frameshift mutations between positions 
657 and 815, pUCVU1923 was first analyzed in this 
region by Maxam-Gilbert sequencing. Plasmid 
pUCVII1923 was digested with Nar I (position 779 
in Figure 1a). The cut DNA was labeled with a 32 ? 
dCTP using DNA polymerase I (Klenow fragment) 
and subsequently digested with Ava I (which 
cleaves at the same site as Sma I in Figure l) and 
Taq I (site at 1059), yielding a Nar l-Ava 1166 bp 
fragment and a 200 bp Nar l-Taq I fragment Each 
of these was sequenced. A C, missing in 
pUCVII2115, was found at position 697 and another 
C. also missing in pUCVll2115 f was found at post- 
Won 798. 

The rest of the sequence of the coding region 
of pUCVH1923 was shown to be correct by sequen- 
cing by the dideoxy method on an M13 subclone 
of the entire insert of pUCVII1923. The Lac primer 
2C87 (Table 1) was used to sequence from posi- 
tion 212 (Figure 1a) to 512; primer ZC218 
(CTCTGCCTGCCGAAC) was used to sequence 
from 715 to 1140 and primer ZC217 
(ATGAGAAGCGCACGAAG) was used for sequen- 
cing from 720 to 350. Since the pUCVII2115 insert 
is correct from position 13 (positions 1-12 include 
an artificial linker) to 695, and pUCVII1923 is cor- 
rect from position 212 to the end, the two were 
spliced together to yield a molecule correct from 
position 13 (Figure 1a) to the end. A convenient 
point utilized for this splice is the Xba i site at 
position 328. The sequence of the spliced cor- 
rected* molecule is shown in Figure 1a. 

Because a full-length Factor VII clone was dif- 
ficult to obtain by cDNA cloning, three strategies 
were adopted to provide the missing coding se- 
quence and the necessary upstream processing 
and signal sequences. The first strategy was to 
obtain the needed sequence from a human 
genomic DNA library or through additional screen- 
ing of cDNA libraries. The second approach was to 
synthesize the necessary 5' coding sequence, 
based on the amino acid sequence data for Factor 
VII (Example 2) and the published sequences of 
the genes encoding vitamin K-dependent clotting 
factors (Kurachi and Davie, ibid; and Davie et a!.. 
ibid), and join this to a portion of the prepro se- 
quence of Factor DC The third strategy relies on 
the functional homology of the amino terminal re- 
gions of Factor VII and Factor IX. A sequence was 



constructed which comprised the coding regions 
for the leader and amino-terminal portion of Factor 
IX. This was then fused in the proper orientation to 
the partial Factor VII cDNA. 

s In order to obtain DNA sequences that com- 

prise the entire DNA sequence of Factor VII, an 
attempt was made to isolate the remaining 5* DNA 
sequence. This was accomplished through the utili- 
zation of the 5* terminal 0.3 kb EcoRI-XbaJ frag- 

io merit from the cDNA insert of XVII21 15 to screen a 
cDNA library comprising 2x10* phage. The library 
was constructed using poly(A) mRNA from HepG2 
cells following an adaptation of the method of 
Gubler and Hoffman (Gene 25: 263-269, 1983). The 

15 RNA was reverse transcribed to generate first 
strand cDNA, followed by second strand synthesis 
using DNA polymerase I and RNase H. Following 
EcoRI methylation and passage over a Sepharose 
6B column, the DNA termini were blunted with T» 

20 DNA polymerase. EcoRI linkers were added and 
excess linkers were removed by digestion with 
EcoRI and chromatography on Sepharose CL 2B. 
The DNA in the void volume was collected and 
ligated to Xgt11 which had been digested with 

25 EcoRI and treated with calf intestinal phosphatase. 
The DNA was packaged and infected into E. cofi 
Y1088. Several positives were detected, and the 
EcoRI fragments were subsequently subcioned into 
M13 phage vectors for dideoxy sequencing using 

30 either the M13 universal primer or Factor VII spe- 
cific oligonucleotides." 

From these, three new cDNA clones of Factor 
Vlf were obtained, and their sequences completely 
determined. The largest of these cDNAs, from a 

35 clone designated XVII2463, was found to contain 
the entire coding sequence for Factor VII. This 
clone included a 35 nucleotide 5* untranslated re- 
gion, 180 nucleotides coding for a 60 amino acid 
leader, 1218 nucleotides coding for the 406 amino 

40 acid mature protein, a stop codon, 1026 
nucleotides of 3* untranslated sequence, and a 20 
base poly(A) tail (beginning at position 2463). This 
cDNA has now been sequenced in its entirety on 
both strands. A comparison of it with two cDNAs 

45 isolated earlier from clones XV1I2115 and XW1923 
revealed that clone XVII2463 contains an additional 
321 nucleotides upstream of the insert in XV1I2115 
and 519 nucleotides upstream of the insert in 
XV1I1923. The overlapping Factor VII sequences of 

so XVII2463 and these two previous cDNAs agree, 
except that the cDNA of XVII2463 does not contain 
single base deletions at positions 1005 and 1106. 
which were detected in the cDNA of XVII2115. 
Thus, XVII2463 contains, on a single EcoRI frag- 

55 ment, a Factor VII cDNA coding for Factor VII 
leader and mature protein sequences. 
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An additional cDNA, XVU565, was isolated and 
found to contain 5 f terminal Factor VII sequences, 
but was truncated within the coding sequences. Its 
5' end maps at nucleotide 9 (Figure 1b). 

When compared with full-length XVII2463, 
XVII565 was found to lack a sequence correspond- 
ing to one exon-like region within the leader se- 
quence. Bases 100-165 are absent from XVII565 - 
(Figure 1b). The absent sequences correspond pre- 
cisely to one exon-like region by comparison with 
genomic sequence data (as described in Example 
III). Thus, the XVII565 structure may be a con- 
sequence of alternative splicing events in the lead- 
er sequence. 

The leader encoded by XVII2463 is exception- 
ally long (60 amino acids) and has a very different 
hydrophobicity profile when compared with Factor 
IX, protein C and prothrombin. This leader contains 
two Mets, at positions -60 and -26. Initiation most 
likely begins at the first Met, since a hydrophobic 
region, typical of signal peptides, follows the Met at 
position -60, but not the Met at -26. it is interesting 
that the absent sequences in XVII565 , which cor- 
responds precisely to an exon-like region in the 
genomic clone, results in a 38 amino acid leader 
with a hydrophobicity pattern more analogous to 
the above proteins. 

Example 2: Animo Acid Sequence of Human Factor 

vit. 

The elucidation of the amino acid sequence of 
human Factor VII was desired in order to confirm 
the identity of putative cDNA clones, substantiate 
the sequence of Factor VII cDNA, provide informa- 
tion allowing for the synthesis of specific 
oligonucleotide probes to screen cDNA and 
genomic libraries for clones containing the 5' se- 
quence, and to construct a synthetic fragment en- 
coding the amino-terminal portion of Factor VII. 
Although limited amino acid sequence was pro- 
vided by Kisiel and McMullen (ibid), more informa- 
tion was needed. 

Purified human Factor Vila (Kisiel and McMul- 
len, ibid) was reduced and carboxymethylated by 
the method of Crestfield et al. a i Biol. Chem. 238: 
622, 1963. The light and heavy polypeptide chains 
of carboxymethylated Factor Vila were separated 
by high-performance liquid chromatography - 
(HPLC) on a Micro Pak C18 reverse phase column 
(Varian Corp.) by generating a gradient of 0.1% 
TFA in distilled water (A) and 0.1% TFA in acetoni- 
trile (B) from 0-40% B in 5 minutes, 40-80% B in 
25 minutes and 80-100% B in 5 minutes. Approxi- 
mately 300 picomoles of each peptide chain were 
analyzed by automated Edman degradation using a 



Gas-Phase Protein Sequencer (Applied 
Biosystems, Inc.). Eighteen and 29 residues were 
identi fied at the amino-termini of the heavy and 
light polypeptide chains, respectively. The amino- 

5 terminal sequence of the heavy chain of Factor Vila 
was consistent with that encoded by cDNA clone 
pUCVII2115 (Figure 2b). Amino acid residues are 
designated within Figures 2a and 2b by single 
letter code as follows: A, alanine; C, cysteine; D, 

10 aspartic acid; E, glutamic acid; F, phenylalanine; G, 
glycine; H, histidine; I,, isoleucine; K, lysine; L, 
leucine; M, methionine; N, asparagine; P, proline; 
Q, glutamine; R, arginine; S, serine; T, threonine; V, 
valine; W, tryptophan; Y, tyrosine; X indicates an 

rs unknown residue and " indicates that the Gla resi- 
dues (7) were assigned by homology to the struc- 
tures of other known clotting factors and by the 
absence of any other phenylthiohydantoin-amino 
acid at those positions. The gaps (-) are placed to 

20 provide the best alignment among the sequences. 
In addition, the information indicated that the amino 
acids at positions five and nine were lysines and 
not threonine and arginine, respectively, as pre- 
viously reported (Kisiel and McMullen, ibid). The 

25 sequence analyses of the light chain of Factor Vila, 
which originates from the amino-terminal region of 
Factor VII, fell short by approximately 6 residues to 
overlap with the structure encoded by the 5' end of 
cDNA clone pUCVII2115. 

30 To obtain additional sequence data, two 
nanomoles of the carboxymethylated light chain 
were digested for 12 hours by bovine chymotrypsin 
(1:100 w/w, enzyme: substrate) in 0.1 M ammo- 
nium bicarbonate, pH 7.8, at 37°C. The generated 

35 fragments were purified by HPLC on a Micro Pak 
C18 reverse phase column using the above sol- 
vents in a gradient of 0-30% B in 5 minutes, 30- 
60% B in 25 minutes and 60-80% B in 10 minutes. 
Peptides were identified by their U.V. absorption at 

40 220 and 280 nm. Lyophiiized peptides - 
(approximately 1 nanomole each) were analyzed by 
Edman degradation. The results (Figure 2b) con- 
firmed much of the cDNA sequence in the cor- 
responding region of clone pUCVII2115. In total, 

45 113 of 152 residues (75%) of the light peptide 
chain of Factor Vila were identified. This sequence 
is identical to that encoded by the known cDNA 
structure. Indirect evidence indicates Asn 145 is a 
site of carbohydrate attachment. 

50 
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Example 3: Cloning of the genomic Factor VII se- 
quence. 

As one approach to providing the 5' end se- 
quence lacking from the cDNA, a lambda phage 
library containing human fetal liver DNA (Lawn et 
aL. CeH 15 : 1157-1174) was screened with nick 
translated Factor VII cDNA. A portion of the 
genomic library was plated on E. ooK LE392 (ATCC 
33572) to produce a total of 7.2 x 10* plaques 
(Maniatis et aL t ibid, pp. 320-321). The phage 
plaques were adsorbed from the plates onto ni- 
trocellulose and hybridized with the ^P-labefed 
cDNA according to the procedure of Benton and 
Davis (Science 196 : 180, 1977). Eight clones were 
obtained and plaque purified. 

Using a DNA fragment (Eco Rla-Xba I, Figure 
1) from the 5* end of the Factor VII cDNA - 
(XVII2115) and standard techniques (Maniatis et a!., 
ibid) those genomic clones containing 5* end se- 
quences were identified. These phage were des- 
ignated 7m1, 7m2 and 7m3. DNA was prepared 
from these recombinant phage and preliminary re- 
striction endonuclease maps derived. Phage 7m1, 
which gave the strongest hybridization signal, was 



used to generate a more extensive restriction map 
and to place the Eco Rl-Xba I cDNA sequences on 
this map by Southern blotting (Southern, J. MoL 
Bioi.98: 503, 1975). 

s In order to determine if phage 7m1 contained 

the DNA sequences encoding the amino terminal 
amino acids of the Factor VII protein, Southern 
blots of phage DNA restriction digests were 
hybridized with mixtures of oligonucleotides whose 

10 sequences were deduced from the Factor VII ami- 
no terminal amino acid sequence. Oligonucleotides 
ZC188, ZC360, and ZC401 (Table 1) were radioac- 
tively labeled with T* polynucleotide kinase and 
hybridized to the phage DNA blots at a few de- 

75 grees centigrade below their Tm (Wallace, R. B., et 
aL, Nuc. Acids Res. 6: 3543-3557, 1979). The re- 
sults of this analysis indicated that a 3.7 kb Sst I 
fragment of 7m1 contained sequences hybridizing 
to these oligonucleotides. This Sst I fragment was 

20 subdoned into M13 for DNA sequence analysis. 
Results obtained using ZC360 as sequencing prim- 
er identified a region approximately 60 nucleotides 
in length, which corresponded to the ami no-termi- 
nal protein sequence data. 
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TABLE 1 



Oligonucleotide • Sequence 



ZC87 TCC CAG TCA CGA CGT 



T G A 

ZC188 GCC GGG CTCA CTC CTC CA GAA GGC GTTGG 

C A G 



ZC212 


GAC 
TCA 


CTG 
TGG 


CAG 


GAT 


CCA 


TGC 


AGC 


GCG 


TGA 


ACA 


TGA 


ZC213 


GAG 
GCG 


GCC 
CTG 


TGG 


TGA 


TTC 


TGC 


CAT 


GAT 


CAT 


GTT 


CAC 


ZC217 


ATG 


AGA 


AGC 


GCA 


CGA 


AG 












ZC218 


CTC 


TGC 


CTG 


CCG 


AAC 














ZC235 


GAT 


CCA 


TGC 


AGC 


GC 














ZC249 


AGA 


ACA 


GCT 


TTG 


TTC 


TTT 


CA 










ZC275 


GCC 


CCC 


ATT 


CTG 


GCA 














ZC286 


CCA 


AAG 


AGG 


GCC 


AAC 


GCC 


TTC 


CTG 


GAG 


GAG 


AGA 




CCT 


GGG 


AGC 


CTG 


GAG 


AGA 


GAG 


TGT 


ATT 


GAG 


G 


ZC287 


AAT 


ACA 


CTC 


TCT 


CTC 


CAG 


GCT 


CCC 


AGG 


TCT 


CTC 




CTC 


CAG 


GAA 


GGC 


GTT 


GGC 


CCT 


CTT 


TGG 






ZC288 


AGC 
TCG 


AGT 
AGG 


GTA 
CCA 


GCT 
GCG 


TCG 
ACG 


AGG 


AGA 


ACA 


GAG 


AGG 


TTT 


ZC289 


AAT 


TCG 


TCG 


CTG 


GCC 


TCG 


AAA 


ACC 


TCT 


CTG 


TTC 




TCC 


TCG 


AAG 


CTA 


CAC 


TGC 


TCC 










ZC333 


CAG 


CTT 


CGT 


CCT 


GTC 


GCT 


GGC 


CTC 








ZC336 


CCT 


CTT 


TGG 


GCC 


TGG 


TGA 












ZC360 


C 

CA 

T 


C 

TC 

T 


C 

TC 

T 


C 

TC 

T 


G 

TT 

A 


CA 












ZC40-1 


CGT 
CTC 


AGC 
CTC 


GTT 
GAA 


CAG 
GCT 


GCC 
ACA 


CTC 
C 


GAA 


GAT 


CTC 


GCG 


GGC 
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Since genomic clone 7m1 was known to con- 
tain 7kb of sequences upstream of exon 2, this 
clone was anticipated to encode Factor VU 5* - 
untranslated sequences and the leader sequences 
up to the amino acid position -17. In order to 



confirm that exon 1 was encoded within genomic 
clone 7 ml, the leader sequence information from 
clones XVII2463 and XVII565 was used to design 
oligonucleotides ZC528 and ZC529 (shown below). 



ZC528 



5f TCA ACA GGC AGG GGC AGC ACT GCA GAG ATT 3 ' 



ZC529 



51 TTC CAC GGC ATG TCC CGT GET TCT CCT CCT 3 ' 



These were used to probe 7m 1 DNA, and a subl- 
cone, 7SD, was found that hybridized to both 
oligonucleotides. Exon 1 was determined to be 
composed of two exonic sequences: exon 1a, 
which hybridized to 2C528 (corresponding to 
nucleotides 1 to 30 in XVII2463), and exon 1b, 
which hybridized to 2C528 (corresponding to 
nucleotides 119 to 148 in XVH2463). The intron 
sequences flanking both exons 1a and 1b have 
been sequenced: 1a contains a consensus splice 
donor sequence at the 3 f end of the exon, and 1b 
is flanked on each terminus with a consensus 
splice acceptor (upstream of 1b) or donor - 
(downstream of 1b) sequence. The position of exon 
1a within genomic clone 7m 1 has been precisely 
mapped, while that of exon 1b has been mapped 



20 



25 



30 



within a defined region. Exon 1b sequences are 
present in XVH2463, while XVII565 appears to be 
derived from RNA spliced between exon 1a and 
exon 2, looping out the 1b exonic sequence. 

A variety of 7m1 subclones in pUC and M13 
vectors were prepared to facilitate sequencing the 
remaining exons. Appropriate oligonucleotides de- 
signed from the cDNA sequences, which corre- 
spond to exons 1 through 7, were used to se- 
quence all but the last exon. The genomic se- 
quence corresponds exactly to the cDNA se- 
quences through these regions. In addition, the 
intron/exon boundaries for exons 1-7 have been 
determined, and most are now precisely mapped 
within clone 7m1. The intron sizes and positions 
within the Factor VII gene are listed in Table 2. 



35 



TABLE 2 

Intron/Exon Junctions in the Factor VII Gene 



Intron 



Amino Acid Position 



Intron Size (Kb) 



A 


-39 


>0.2 


B 


-17 


>1.0 


C 


37/38 


1.92 


D 


46 


0.068 


E 


84 


-2 


F 


131 


-1 


G 


167/168 


0.56 


H 


209 


1.31 
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Phage 7m 1 was known to lack the Factor VII 
3'-terminus, which includes exon 8. In order to 
obtain these sequences, a 12-13 kb-enriched Bam 
HI library in XL47.1 (Loenen and Brammer, Gene 
10: 249, 1980; Maniatis, et al. a ibid.), derived from 
human dermal primary fibroblast cells, was probed 
with two nick-translated Factor VII cDNA Pstl frag- 
ments (corresponding to sequences in exon 7, and 
to 3'-untranslated sequences). A clone, designated 
7DC1, was detected by both probes. Subsequent 
restriction endonuclease and Southern blot analysis 
established that clone 7DC1 overlaps with, and 
extends approximately 3 Kb beyond the terminus 
of, clone 7m1, and that it contains exon 8. The 3.9 
Kb (Xbal-BamHI) fragment from 7DC1 DNA con- 
taining exon 8 was subcloned into M13, and se- 
quence analysis was performed using 
oligonucleotides complementary to its 5' and 3' 
termini. The entire exon sequence is present in this 
clone. 

Example 4: Factor IX-Factor VII Hybrid Genes Con- 
taining a Synthesized Coding Sequence. 



A. Construction df a hybrid Factor IX leader-syn- 
thetic Factor VII 5 1 coding sequence. 

The second alternative for obtaining the 5' cod- 
ing sequence for Factor VII was synthesis of an 
appropriate double-stranded fragment, using a 
nucleotide sequence predicted on the basis of the 
amino terminal amino acid sequence of Factor VII, 
the amino acid sequences of other vitamin K-de- 
pendent clotting factors, and the known nucleotide 
sequences of other vitamin K-dependent clotting 
factor genes (Kurachi and Davie, ibid; Anson et at., 
EMBO J. 3: 1053-1060,1984; and Davie et al., ibid). 
In order to provide the necessary secretion and 
processing signals for secretion of a mature Factor 
VII analog, this synthetic fragment (the consensus 
sequence) was joined to one of two leader se- 
quences derived from a Factor IX cDNA clone. 
This strategy is outlined in Figure 3. 

A cDNA coding for human Factor IX was ob- 
tained from a library made with mRNA from human 
liver (Kurachi and Davie, ibid). The Factor IX se- 
quence was isolated from the pBR322 vector by 
digestion with Pst I and was inserted into the Pst I 
site of pUC13. This plasmid was designated FIX- 
pUC13. In order to remove the G-rich region which 
was present at the 5' end of the Factor IX insert as 
a result of cDNA cloning, a synthetic 
oligonucleotide adaptor was substituted for the 5' 
end of the cloned fragment. Oligonucleotides 
ZC212 and ZC213 (Table 1) were synthesized and 



annealed to generate a 22 base pair overlap, the 
fragment ends filled in and cut with appropriate 
restriction endonucleases, and the resulting frag- 
ment was joined to the Factor IX sequence. 

5 To construct the adaptor, 100 pmoles each of 
ZC212 and ZC213 were lyophilized and resuspen- 
ded in 10 ul of 10x kinase/ligase buffer (600 mM 
Tris pH 8.0, 100 mM MgCl a , 100 nM DTT) plus 86 
ul HjO. The annealing reaction was run at 65°C for 

70 10 minutes, the mixture was slowly cooled to room 
temperature and put on ice. To this mixture was 
added 4 ul of 2.5mM dNTP mix and 1 ul (8 units) 
T* DNA polymerase. The reaction was allowed to 
proceed 45 minutes at 14°C. Ten ul of 5 M 

15 NH«OAc was then added and the DNA was ex- 
tracted once with phenol/CHCI 3 , twice with CHCI 3 , 
and was precipitated with ethanol. The DNA was 
centrifuged and resuspended in 100 ul medium salt 
buffer (Maniatis et al., ibid, p. 100), digested with 9 

20 units Pst I and 8 units Cfo I, and extracted as 
above. 

The modified Factor IX sequence was then 
constructed by combining 0.16 pmoles of the syn- 
thetic Pst l-Cfo I adaptor fragment, 0.14 pmoles of 

25 a 1 .4 kb Cfo I-Bam HI Factor IX fragment from FIX- 
pUC13, and 0.14 pmoles of a 2.7 kb Bam Hl-Pst I 
pUC1 3 vector fragment in a 20 ul reaction contain- 
ing 60 mM Tris-HCI pH 7.5, 10 mM MgCl 2 ,-10 mM 
DTT, and 0.9 units T* ligase. The reaction was 

30 incubated for 3 h at room temperature and used to 
transform competent E. coli JM83 (Messing, Re- 
combinant DNA Technical Bulletin, NIH Publication 
No. 79-99, 2, No. 2, 43-48, 1979). The ceils were 
plated with 50 ul of 2% X-gal (5 bromo-4-chloro-3 

35 idolyl-0-D-gaiactoside) on L-broth containing 40 
ug/ml. ampicillin and incubated at 37°C overnight. 
White colonies were picked onto another plate con- 
taining ampicillin and grown at 37°C overnight. The 
colonies were blotted on Whatman 540 paper and 

40 the paper prepared for hybridization according to 
the method of Wallace et al. (Gene 16: 21, 1981), 
except the overnight incubation on chloramphenicol 
plates was omitted. The papers were incubated at 
44«C for 2 h in 0.9 M NaCI, 0.09M Tris-HCI pH 7.5, 

45 6 mM EDTA, 0.5% Nonidet P-40, 150 ug/ml E. coK 
tRNA. The papers were probed with 32 P-labeled 
ZC235 (Table 1), a 14-mer that is specific for the 
altered 5 f end sequence. Hybridization with 1-2 x 
10 6 cpm per filter was carried out at 44°C in the 

50 prehybridization buffer overnight. The filters were 
then washed 3 times in 6 x SSC, 0.1% SDS at 4°C 
and 3 times in 2 x SSC, 0.1% SDS at 44°C and 
exposed to X-ray film. Two positive clones were 
obtained. One of these clones was designated FIX 

55 (-G)-pUC13. 
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In order to confirm the sequence of the altered 
region of the Factor IX portion of the F1X(-G) — 
pUC13 construct, dideoxy sequencing directly on 
the pUC plasmid using the BRL reverse primer was 
performed using the method of Wallace et al., 1981 
(ibid) using a primer end labeled with poly- 
nucleotide kinase and y**P ATP by the method of 
Chaconas et al. (ibid). The sequence was as pre- 
dicted. 

The resulting recombinant plasmid contains 
three Hae III cleavage sites, the first at position 39 
in the Factor IX sequence (numbering is based on 
the published sequence of Anson et al. (ibid), be- 
ginning at the first ATG), the second at position 
130, and a third in the pUCl3 polyiinker. The site 
at 130 is a single base pair upstream from the 
codons for the Lys-Arg processing site of the prepd 
Factor IX molecule. In the final Factor IX-Factor VII 
hybrid constructs, the Factor IX leader sequence, 
terminated at position 39 or 130, was joined to a 
synthetic double-stranded fragment comprising the 
predicted consensus sequence and the last 3 
codons of the Factor IX leader sequence. 

The synthetic consensus fragment was pro- 
duced by joining oligonucleotides ZC286-ZC289 - 
(Table 1) to form a double-stranded fragment One 
hundred pmole of each oligonucleotide was 
lyophilized and resuspended in 20 ul of 1x kinase 
buffer and incubated overnight at 4°C; then heated 
at 65° C for 10 minutes. Two pools were made 
using the kinased oligonucleotides. Pool 1 con- 
tained 2C286 + ZC287; pool 2 contained ZC288 
+ ZC289. The pooled pairs were annealed 10 
minutes at 65°C, then cooled to room temperature 
over a period of 2 hours and placed on ice for 30 
minutes. 

The modified Factor DC fragment was removed 
from FIX(-G)— pUC13 as a Hind III-Eco R! frag- 
ment Approximately 20 ug of plasmid was di- 
gested with 30 units each of Hind HI and Eco RI in 
100 ul Hind III buffer (BRL) containing 4 ug RNase 
A at 37° C overnight The reaction was terminated 
by heating at 65° C for 10 minutes, and the vector 
and Factor IX fragments were electrophoresed on a 
1% agarose gel and purified by electro-elution. The 
Factor IX fragment was precipitated with ethanol. 
resuspended in buffer containing 400 ng/ul RNase 
A, and digested with 9 units of Hae III overnight at 
37°C. The Hind III-Hae ill 39 base pair Factor IX 
fragment was isolated from this digest by elec- 
trophoresis on a 1.5% agarose gel followed by 
electro-elution. To obtain the Hind III-Hae III 130 
base pair Factor IX fragment FIX-pUC13 was di- 
gested with Eco RI and Hind Hi and the Factor IX 
fragment isolated as above. Approximately 3 ug of 
this Hind IH-Eco RI fragment was digested with 6 



units of Hae III at 37°C and aliquots were removed 
at five minute intervals over 30 minutes into a 
solution containing 50 mm EDTA. The aliquots 
were pooled and the Hind III-Hae III 130 base pair 

5 fragment was purified by electrophoresis on a 5% 
acrylamide gel followed by electro-elution. 

The final Factor IX-consensus sequence hy- 
brids were prepared by joining, in a four-part liga- 
tion, oligonucleotide pools 1 and 2, Factor IX Hind 

10 III-Hae III (39 or 130 base pairs), and pUC13 Wnd 
III-Eco RI. The resulting plasmids were used to 
transform E. coji HB101 (ATCC 33694). Colonies 
were screened by digestion of DNA with Eco RI 
and Hind ill. The sequence comprising the 39 base 

75 pair Factor IX sequence joined to the synthetic 
consensus sequence is hereinafter referred to as 
mini-FIX-FVII. The plasmid containing this construct 
was designated pM7200(-C). The sequence com- 
prising the 130 base pair Factor IX sequence 

20 joined to the synthetic consensus sequence is re- 
ferred to as maxi-FlX-FVIL The plasmid containing 
this construct was designated pM7100(-C). The 
consensus sequence encodes a polypeptide com- 
prising the amino acid sequence Ala-Asn-Ala-Phe- 

25 Leu-GIa-Gla-Arg-Pro-Gly-Ser-Leu-GIa-Arg-GIa-Cys- 
Lys-Gla-Gln-Cys-Ser-Phe-Gla-Gla-AIa-Arg-GIarlle- 
Phe-GIa-Gly-Leu-Asn-Arg-Thr-Lys-Leu. 

B. Joining Factor IX-consensus sequence hybrid 
30 fragment to Factor VII cDNA clone. 

The Factor IX-consensus sequence hybrids 
(either mini or maxi) were joined to the 5* portion of 
the Factor VII cDNA and the vector pUCl3 in a 
35 three-part ligation (Figures 4 and 5). The vector 
fragment was produced by digesting 6 ug of 
PUC13 with 10 units each of Xba I and Hind HI in 
Hind III buffer containing RNase A (400 ng/ul). The 
mini-FIX-FVIi fragment was produced by digesting 
40 2 ug of pM7200(-C) with 10 units each of Hind III 
and Eco RI as above. The maxi-FIX-FVll fragment 
was similarly prepared from pM7100(-C). The 5 f 
portion of the Factor VII cDNA was prepared from a 
plasmid (pUCG705) comprising the Eco Rl-Xba I 5' 
45 fragment of pUCVII2115 subcloned into pUCl3 by 
digestion with Xba I and Eco RI. Digests were run 
at 37° C for 2 hours and the products were sepa- 
rated by electrophoresis on a 1.5% agarose gel. 
The desired fragments were electro-eiuted, extract- 
so ed with phenol/CHCIi and CHCI* and precipitated 
with ethanol. The three fragments, pUCl3/Xba I- 
Hind HI, Factor IX-Factor VII (mini or maxi)/Hind III- 
Eco RI, and 5' Factor VIl/Eco Rl-Xba I were then 
ligated in 20 ul of ligase buffer containing 2 ul 20 
55 mM ATP and 0.9 unit T« DNA ligase overnight at 
4°C. Colonies were screened by restriction analy- 
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sis with tfind ill and Xba I. The recombinant plas- 
mids containing the mini-and maxi-FIX-FVII se- 
quences were designated pM7200 and pM7100, 
respectively (Rgure 4). 

Due to the linker addition used in producing 
the Factor VII cDNA, modifications had to be made 
in the fusion sequences to generate correct in 
frame coding sequences. Both mini-and maxi-fu- 
sions contain an Eco Rl site at the junction be- 
tween the Factor IX-consensus sequence hybrid 
and the Factor VII cDNA which is an artifact of the 
cDNA cloning process. In addition, the mini-fusion 
requires the addition of a C to change the se- 
quence at the Hae III site from 5 'AGGCCA 3 ' to 
S AGGCCCA 3 * and establish the correct reading 
frame downstream of this sequence. These correc- 
tions were made by oligonucleotide-directed site 
specific mutagenesis, essentially as described for 
the two-primer method by Zoller and Smith - 
(Manual for Advanced Techniques in Molecular 
Cloning Course , Cold Spring Harbor Laboratory, 
1983). The mini-FIX-FVII fragment was removed 
' from pM7200 by digestion with Hind III and Xba I 
and inserted into M13mp19. The maxi-FIX-FVII 
fragment was purified from pM7lO0 and subcloned 
in a similar manner. The mutagenic primers ZC333 
and ZC336 (see Table I) were used for removal of 
the Eco Rl site and the base insertion, respectively. 
In each case, the universal primer ZC87 was used 
as the second primer. The mutagenic primers were 
phosphorylated by combining 40 pmoles of primer 
and 60 pmoles ATP with 1 unit of T 4 DNA kinase 
overnight at 60°C. To remove the Eco Rl site from 
the maxi-FIX-FVII hybrid, 1 ug of the M13 single- 
stranded template was combined with 20 pmoles 
each ZC333 and ZC87 in a total volume of 10 til. 
The primers were annealed to the template for 10 
minutes at 65 °C, cooled to room temperature for 5 
minutes, then placed on ice for 5 minutes. The 
primers were extended using DNA polymerase I - 
(Klenow fragment). To remove the Eco Rl site and 
correct the reading frame in the mini-FIX-FVII hy- 
brid, 1 ug of the appropriate M13 single-stranded 
template was combined with 20 pmoles each 
ZC333,. ZC336 and ZC87. Annealing and primer 
extension reactions were carried out as described 
above. Plaque lifts were screened with 32 P-labeled 
primer (ZC333 or ZC336) at 60°C and sequences 
confirmed by dideoxy sequencing. The resultant 
constructs, comprising the maxl-and mini-FIX-FVII 
sequences, were designated pM7111 and pM7211, 
respectively. 

The consensus sequence contains several re- 
gions which do not conform to the protein se- 
quence data obtained for Factor VII (Figure 2). In 
order to produce a sequence which encodes a 



polypeptide with greater homology to the amino- 
terminal portion of Factor VII, the consensus se- 
quence was altered by oligonucleotide-directed 
site-specific mutagenesis. The changes made were 

5 the insertion of Leu at position 8, substitution of lie 
for Lys at position 18 (numbers refer to the amino 
acid position after the inseertion at position 8), Asn 
for Ala at position 26, and the sequence Ala-Ser- 
Asp for Gly-Leu-Asn at positions 32-34 (based on 

70 tentative amino acid sequence data). 

The sequence changes at positions 8 and 18 
were made using pM7111 (sense strand) as tem- 
plate. Primers ZC352 f CCC AGG TCT CAG CTC 
CTC CAG 3 ) and ZC353 (*' CTG CTC CTC CTT 

75 ACA CTC TCT 3 ' ) were annealed to the template 
and extended as described above. The resultant 
phage clone was designated pM7114. The se- 
quence of the insert in pM7114 was confirmed by 
dideoxy sequencing. 

20 In a similar manner, the changes at positions 
26-34 were made on the pM7114 template (sense 
strand) using the mutagenic primer ZC366 ( s ' CAG 
CTT CGT CCT GTT CAG GCC CTC GAA GAT 
CTC GCG GGC CTC CTC GAA 3 ') and ZC87 (Table 

25 1) as second primer. The resultant construct was 
designated pM7115. The sequence of the entire 
550 bp insert in the M13 vector was determined by 
dideoxy sequencing and found to be correct. 

30 Example 5: Construction of Factor IX-Factor VII 
cDNA fusion. 

The Factor IX-Factor VII cDNA fusion was pre- 
pared using Factor IX cDNA obtained from a hu- 

35 man liver cDNA library as described by Kurachi 
and Davie (ibid) and the Factor VII cDNA sequence 
described in Example 1. 

The fusion point chosen for the hybrid protein 
was between amino acid + 38 (threonine) of Factor 

40 IX and the first lysine encoded by the Factor VII 
cDNA sequence. Such a protein would be encoded 
by a sequence consisting of the first 252 bp of the 
Factor IX cDNA sequence and all of the 
pUCVII2115 Factor VII cDNA sequence except the 

45 first two codons. To construct this hybrid se- 
quence, the Factor IX sequence was first fused to 
pUCVII2115 using convenient restriction sites. This 
fusion resulted in the plasmid FIXA/ll/12 (described 
below) which contains the first 310 bp of the Factor 

so IX cDNA joined to the entire Factor VII cDNA 
sequence. To achieve the precise junction desired 
for the hydrid protein, the intervening base pairs 
were removed by oligonucleotide-directed 
mutagenesis. 

55 
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Joining of the Factor IX cDNA sequence to 
Factor VII cDNA sequence was accomplished by 
ligating a 0.3 kb Hind III-Aha III fragment of FIX (- 
Q) — pUC13 (Example 4) to a 4.7 kb Sma l-Hind ill 
fragment from pUCVII2H5 (Figure 5). The Hind lll- 
Aha III fragment was prepared by digesting 3 ug of 
FIX(-G).-*pUC13 with 40 units of Hind III in 40 ul of 
medium salt buffer (Maniaiis et al. f ibid) at 37°C, 4 
hours. The volume was then increased to 100 ul of 
medium salt buffer, and 5 units of Aha III were 
added and the 37°0 incubation continued for 18 
hours. The DNA fragments were separated by elec- 
trophoresis in 1% agarose and the 0.3 kb band 
isolated as described above. A Sma I parital diges- 
tion of pUCVII2115 was obtained by incubating 3 
ug of pUCVII2115 at 25°C for 1 hour with 4.8 units 
of Sma I in a reaction volume of 30 ul. the reaction 
was stopped by a 15-minute incubation at 65 °C. 
The sample was then extracted once with an equal 
volume of phenol and ethanol precipitated. 

The precipitate was collected by a 10-minute 
microfuge spin, rinsed with 70% ethanol and air 
dried. The DNA was redissolved in 30 ul of me- 
dium salt buffer and digested with 30 units of Hind 
HI at 37°C for 3 hours. The DNA was subjected to 
electrophoresis in 0.7% agarose and the 4.7 kb 
Hind lll-Sma I fragment isolated as described 
above. Equimolar amounts of the two fragments 
(0.048 pmoles) were ligated in a 10 ul reaction 
containing 50 mM Tris-HCI pH 7.5, 10 mM MgCl 2i 
1 mM DTT, 1 mM ATP, and 3 units of T< DNA 
ilgase at 14°C for 3.5 hours and then used to 
transform competent E. coii RRI (ATCC 31343). 
The ceils were grown on ampicitiin plates and 12 of 
the resulting colonies were screened by restriction 
enzyme digestion for the presence of the desired 
plasmid construction. DNA from colony 12 
(FIXA/ll/12) gave the expected restriction enzyme 
digestion pattern and was used in the next step of 
the hybrid gene construction. 

The oligonucleotide-directed mutagensis proce- 
dure was performed on a single-stranded DNA 
template. Thus, it was necessary to clone the fused 
Factor IX/Factor VII sequences into M13mp19. To 
obtain a conveniently small DNA fragment, a 640 
bp Hind Ill-Xba I fragment was isolated from 
FDC/VII/12. This fragment contains 310 bp of the 5' 
end of Factor IX cDNA and 330 bp of the Factor VII 
sequence. The vector was prepared by digesting 1 
ug of M13mp19 RF DNA with 20 units of Hind III 
and 20 units of Xba I in 40 ul of medium salt buffer 
at 37°C for 18 hours. The DNA was subjected to 
electrophoresis in 1.2% agarose and the linear 6.4 
kb fragment isolated from the gel as described 
above. Five ug of F1X/VH/12 DNA was digested with 
10 units of Xba I in 40 ul of medium salt buffer at 



37° for 18 hours. Twenty units of Hind III were 
added and the digestion continued at 37°C for an 
additional 7 hours. The resulting fragments were 
separated by electrophoresis in 12% agarose and 

5 the 640 bp fragment eluted as above. Ten ng of 
linearized m13mp19 and 1 ng of the 640 bp frag- 
ment were ligated at 14°C for 1 hour and then 
used to transform competent E. roji JM101 - 
(Messing, Meth. in Enzvmoloov. ibid). The cells 

io were plated with X-gal and IPTG (Messing, Meth. in 
EnzvmoloQv . ibid) and eight light blue plaques 
were picked and used to infect 2.5 ml cultures of 
E. coI[ JM103 at A«oo = 0.3. After 18 hours' growth 
at 37° C, the cells were harvested by centrifugation 

75 in a room temperature clinical centrifuge and 20 ul 
of the supernatant which contains the M13 phage 
was mixed with 10 ug/1 ethidtum bromide. By 
comparison with known standards, each of the 
eight clones had an insert of approximailey the 

20 correct size. Single-stranded DNA was then pre- 
pared from 1.5 ml of the supematants as described 
by Messing (Meth, in Enzymoloov. ibid). This con- 
struct was then sequenced by the dideoxy method 
using the oligonucleotide ZC87 as a primer to 

25 confirm that the insert junction was correct One of 
the correct clones (#4) was used as a template in 
oligonucleotide-directed mutagenesis to produce a 
functional Factor IX- Factor VII fusion. 

The oligonucleotide ZC249, a 20-mer consist- 

30 ing of 10 bp of the desired Factor IX sequence and 
10 bp of the desired Factor VII sequence (Table I) 
was used as the mutagenic primer. The 
oligonucleotide * ZC87, which hybridizes, to the 
M13mp19 sequence, was used as the second 

35 primer. 

The mutagenesis procedure was modified from 
that of ZoIIer and Smith (ibid). For the annealing 
reaction, 20 pmoles of ZC249 were phosphorylated 
by incubating overnight at 4°C in 20 ul 60 mM 

40 Tris-HCI pH 8.0, 10 mM MgCI a , 1 mM DTT, 1mM 
ATP, 1 unit T< kinase. The reaction was stopped by 
incubation at 65°C for 15 minutes, and the sample 
was lyophlized. One pmole of single-stranded 
clone #4 template and 20 pmole of ZC87 were 

45 added in 10 ul annealing buffer (200 mM Tris-HCI 
pH 7.5. 100 mM MgCI 2 . 500 mM NaCI, 10 mM 
DTT). The sample was heated to 65°C for 10 
minutes, incubated at room temperature for 5 min- 
utes, and then placed on ice. Ten ul of the follow- 
so ing solution was prepared fresh and added to the 
sample: 20 mM Tris-HCI pH 7.5, 10 mM MgO, 10 
mM DTT, 1 mM dNTPs, 1 mM ATP, 0.15 unitsAji 
T. DNA ligase, 0.25 units/ul E. a)K DNA Poly- 
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merase I (Klenow fragment). The reaction was then 
incubated at 15°C for 3 hours and the sample used 
to transform competent E. coH JM101 (Messing, 
Meth. in EnzvmolQQv , ibid). 

The resulting plaques were lifted onto nitrocel- 
lulose and screened by hybridization to ^P-labeled 
ZC249. Dry BA85 filters (Schliecher & Schuell, 0.45 
urn) were laid onto the agar plate and the phage 
allowed to adsorb for 5 minutes. The filters were 
removed and allowed to dry for 5 minutes, placed 
on Whatman 3 MM paper, saturated in 0.5 M 
NaOH, 1.5 M NaCI for 5 minutes, air dried for 3 
minutes, placed on Whatman paper, saturated in 1 
M Tris-HCI pH 8, 1 .5 M NaCI, for 5 minutes, and 
air dried for 3 minutes. The Tris-HCI step was 
repeated and the filters were rinsed in 100 ml 6 x . 
SSC for 2 minutes at room temperature. After* air 
drying, the filters were baked at 80 °C for 2 hours 
and prehybridized at 47°C (Tm-4° of ZC249) over- 
night in 6.7 x SSC pH 6.5, 2 mg/ml E. coli tRNA, 
and 0.2% (w/v) each BSA, Ficoll, and polyvinyl- 
pryrolidine. 

After the prehybridization step, the filters were 
incubated with 2.5 x 10 s cpm/filter of labeled 
ZC249 in the same SSC hybridization buffer at 
47°C overnight. Following hybridization, the filters 
were washed 3 times, 5-10 minutes each, at room 
temperature in 6 x SSC and exposed to X-ray film. 
Putative positive plaques were replated and 
screened as above. Individual plaques were then 
picked, and single-stranded DNA was prepared and 
sequenced using ZC275 as a primer. The 
oligonucleotide ZC275 corresponds to a sequence 
40 bp in the 5* direction of ZC249 on the same 
strand (Tablel). 

Four positive plaques were identified. The en- 
tire insert in M13mp19 for one clone (FIXA/ll-9) was 
sequenced by the dideoxy method using the 
oligonucleotides ZC87 and ZC275 and determined 
to be correct The confirmed sequence is repre- 
sented by bases 1-567 in Figure 7. RF DNA from 
this clone was then used for the final step in the 
construction of the hybrid gene. 

Three fragments were used to make the final 
construction: the 0.6 kb Hind Ill-Xba I fragment 
from FIXA/ll-9 containing the fused IXA/II se- 
quences; a 1.7 kb Xba l-Bam HI Factor VII cDNA 
fragment from pUCVII1923; and a 2.7 kb Bam Hl- 
Hind III fragment of pUC13. Three ug of FIXA/II-9 
(RF DNA) were digested at 37° C for 6 hours with 
45 units of Xba I in a volume of 50 ul. The DNA 
was precipitated with ethanol, resuspended and 
digested at 37°C for 4 h with 50 units of Hind III. 
The sample was subjected to electrophoresis in 



1% agarose and the 0.6 kb band electro-eluted 
from the paper with 1.5 M NaCI, 50 mM Tris-HCI 
pH 8, 1 mM EDTA, phenol extracted and precipitat- 
ed with ethanol. 

5 To obtain the remaining Factor VII cDNA se- 

quence, 5 ug of pUCVII1923 was digested at 37° C 
for 3 hours with 36 units of Xba I in 40 ul of 
medium salt buffer. Then 8 ul of 10x high salt 
buffer, 28 ul of H 2 0, and 4 ui (40 units) of Bam HI 

w were added and the reaction incubated at 37°C for 
3 hours. The DNA fragments were separated by 
electrophoresis in 1% agarose and the 1.7 kb frag- 
ment isolated as described above. 

The vector fragment was prepared by digesting 

75 1 ug of pUC13 with 10 units of Hind ill in 20 ul of 
medium salt buffer at 37°C for 1 hour. Two ul of 
10x high salt buffer and 10 units of Bam HI were 
then added and the incubation continued for an- 
other 2 hours. The DNA was purified on a 1% 

20 agarose gel as described above. 

Equimolar amounts (approximately 0.56 
pmoles) of the three fragments were ligated at 
room temperature for 45 minutes in 10 ul of 50 mM 
Tris-HCI pH 7.5, 10 mM MgCU, 1 mM DTT, 1mM 

25 ATP and 3 units T« DNA ligase. The reaction mix- 
ture was used to transform competent E. coti 
JM83. The cells were plated on medium containing 
40 ug/ml ampicillin with 50 ul of 2% X-gal added to 
each plate. DNA was prepared from 7 white colo- 

30 nies and then screened by restriction enzyme di- 
gestion. One of the clones giving the correct pat- 
tern was designated FIXA/II — pUC13. 

Example 6: Expression of Biologically Active Factor 
35 VII Analogs. 

The mammalian cell expression vector pD2 
was chosen for expression of the FIXA/II gene in 
transfected animal cells. It was constructed from 

40 plasmid pDHFR-III (Berkner and Sharp, Nuc. Acids 
Res. 13: 841-857, 1985) in the following manner. 
The Pst I site abutting the DHFR cDNA in pDHFR 
111 was converted to a Bam HI site by conventional 
linkering (Scheller, R. H., Dickerson, R.E., Boyer, H. 

45 W., Riggs, A.D., and Itakura, K., Science 196 : 177- 
180, 1977). The pDHFR III DNA was incubated with 
10 mM Tris pH 7.6, 6 mM 0-MSH, 6 mM NaCI, 10 
mM MgCl a and 2.5 units Pst I for 10 minutes at 
37°C, followed by phenol extraction and ethanol 

50 precipitation. The Pst I cohesive termini were blunt 
ended using T« DNA polymerase. After phenol ex- 
traction and dialysis against 10 mM Tris pH 8.0, 1 
mM EDTA, 0.3 M NaCI, the DNA was ethanol 
precipitated. The DNA was resuspended in 20 ul 

55 1.4 mM ATP, 50 mM Tris pH 7.6, 10 mM MgCI 2 , 1 
mM dithiothreitol and then incubated with 5 ng of 
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T« polynucleotide kinase-treated Bam HI linkers - 
(New England Blolabs) and 200 units of T* poly- 
nucleotide ligase for 12 hours at 12°C, followed by 
phenol extraction and ethanol precipitation. The 
DNA was digested with 90 units of Bam HI at 37°C 
for 1 hour, followed by electrophoresis through a 
1.4% agarose gel. The 4.9 kb DNA fragment 
(corresponding to pDHFR III DNA lacking the 
DHFR cDNA and SV40 polyadenylation signal) was 
efectro-eluted and recircularized with poly- 
nucleotide ligase and then transfected into E. coji 
HB101. Ampicillin-sensitive colonies were screened 
by rapid prep analysis (Bimboim, H.C., and Doly, 
J M Nucleic Acids Research 7: 1513-1523, 1979) 
and the correct clone was grown up to generate a 
large-scale plasmid DNA preparation. 

The resultant plasmid was cleaved with 20 
units Bam HI and treated with 2.5 ug calf intestinal 
phosphatase and electrophoresed on a 1.4% 
agarose gel. Twenty-five ug of pSV40 (a clone of 
SV40 DNA inserted into the Bam Hi site of 
pBR322) were digested with 25 units of Bel I for 1 
hour at 5G°C, followed by the addition of 25 units 
of Bam HI, and the incubation continued for 1 hour 
at 37° C. This DNA was then electrophoresed on a 
1.4% agarose gel. The Bam Hl-cut vector (i.e., that 
lacking the polyadenylation signal) was joined to 
the SV40 DNA fragment (.14 to .19 map units - 
[Tooze, J. f ed., "DNA Tumor Viruses, Molecular 
Biology of Tumor viruses"]) containing the late 
polyadenylation signal by incubating the gel-puri- 
fied fragments (0.1 ug each) in 20 ul 50 mM Tris 
pH 7.6, 10 mM MgCI 2 , 1 mM dithiothreitol, 1.4 mM 
ATP and 100 units T« polynucleotide ligase for 4 
hours at 12°C, followed by transformation into E. 
coli RR1 . Positive colonies were identified by rapid 
prep analysis, and a large-scale plasmid prepara- 
tion of the correct DNA, pD2, was prepared. 

To make the Factor lX/VII expression construc- 
tion, 1 ug of pD2 was digested at 37° C for 1 hour 
with 20 units of Bam HI in 20 ul of high salt buffer. 
Twenty ul of 10 mM Tris-HCI pH 8, 1 mM EDTA 
and 0.1 unit of calf alkaline phosphatase - 
(Boeringer) were then added. The reaction was 
incubated at 37° C for 1 hour and stopped by 
heating to 75° C for 10 minutes. Ten ug of FIX/VU 
— pUCl3 was digested at 37° C for 2 hours with 
150 units of Bam HI in 150 ul of high salt buffer. 
The DNA fragments were separated by elec- 
trophoresis in 1.2% agarose and the 2.3 kb frag- 
ment was isolated. Equimolar amounts (0.015 
pmoles) of the 2.3 kb Bam HI fragment and the 
pD2 vector fragment were ligated at 14°C for 2.5 
hours as above. The reaction mixture was used to 
transform E. con RR1 cells, which were then plated 
on medium containing 10 ug/ml ampicillin. Plasmid 



DNA was prepared from 12 of the resulting colo- 
nies and screened by restriction enzyme digestion. 
One of the clones with the correct enzyme diges- 
tion pattern was designated RX/VII/pD2 (Figure 6). 
5 E. coli RR1 transformed with FlX/VII/pD2 has been 
deposited with ATCC under accession number 
53068. 

The procedure used to transfect baby hamster 
kidney (BHK) ceils (available from American Type 

70 Culture Collection, accession number CCL10) with 
FIX/V1l/pD2 was similar to published methods (for 
example, Wigler et a!., C§U 14: 725, 1978; Corsaro 
and Pearson. Somatic Cell Genetics 7: 603, 1981; 
Graham and Van der Eb, Virology 52: 456, 1973). 

75 The BHK cells were grown at 37° C, 5% CO* in 
Dulbecco's media (plus 10% heat-inactivated fetal 
calf serum and supplemented with glutamine and 
penicillin-strep-tomycin) in 60 mm tissue culture 
Petri dishes to a confluency of 20%. A total of 10 

20 ug DNA was used to transfect one 60 mm dish: 
3.75 ug of RXMI/pD2, 1^5 ug of pKOneo - 
(Southern and Berg, J. Mol. AddI. Genet I: 327- 
341, 1982) and 5 ug of salmon sperm DNA. The 
DNAs were precipitated in 0.3 M NaOAc, 75% 

25 ethanol, rinsed with 70% ethanol and redissoJved in 
20 ul 10 mM Tris-HCI pH 8, 1 mM EDTA. The DNA 
was combined with 440 ul H 2 0 and 500 ul of 280 
mM NaCI. 1.5 mM NaHPO*. 12 mM dextrose, 50 
mM HEPES pH 7.12. Sixty ul of 2 M CaCl* were 

30 added dropwise to the above mixture and the solu- 
tion let stand at room temperature for 30 minutes. 
The solution was then added to the cells and the 
cells returned to 37° C for 4 hours. The medium 
was removed and 5 ml of 20% DMSO in Duh 

35 becco's with serum were added for 2 minutes at 
room temperature. The dish was then washed rap- 
idly with 2 changes of medium and incubated in 
fresh medium overnight Twenty-four hours after 
the DNA was added, the medium was removed and 

40 selective medium added (10 mg/ml of G418, 498 
ug/mg, Gibco, in Dulbecco's with serum). After 10 
and 13 days, individual clones, representing cells 
that had incorporated the pKO-neo gene and were 
thus resistant to G418, were transferred to 96-weli - 

45 (or 24-well) plates and grown up for protein assays. 

Cells were grown in Dulbecco's plus 10% fetal 
calf serum containing 5 ug/ml vitamin K - 
(Phytonadione, Merck). The medium was separated 
from the cells and cellular debris by centrifugation, 

so and assayed for Factor VII polypeptide (by EUSA) 
and for biological activity. The cells were removed 
from the plates with trypsin, washed with fresh 
medium, centrifuged, and frozen at -20°C. The cell 

55 
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pellets were then thawed in PBS, pelleted, and 
resuspended In PBS containing 0.25% Triton X- 
100. Samples were diluted and assayed for poly- 
peptide and activity. 

The ELISA for Factor VII was done as follows. 
Two hundred microliters of a monoclonal antibody 
against human Factor VII (5 ul/ml in 0.1 M Na 2 C0 3 
pH 9.6) were incubated in each well of a 96-well 
microtiter plate 2 hours at 37°C. The wells were 
then incubated with 220 ul of 1% bovine serum 
albumin (BSA) and 0.05% Tween 20 in PBS pH 72 
2 hours at 37°C. The plates were rinsed with H 2 0, 
air dried, and stored at 4°C. To assay samples, 
200 ul samples were incubated 1 hour at room 
temperature in the antibody-coated wells. The wells 
were then rinsed four times with 200 ul PBS con- 
taining 0.05% Tween 20. The wells" were then 
incubated for 1 hour at room temperature with 200 



ul of an IgG fraction of rabbit polyclonal antiserum 
against Factor VII (5 ug/ml in PBS containing 1% 
BSA and 0.05% Tween 20). This was followed by 
incubation with goat anti-rabbit IgG coupled to al- 

s kaline phosphatase. The wells were then rinsed 
four times with PBS containing 0.05% Tween 20. 
To the wells were added 200 ul p-nitrophenyl phos- 
phate (30 mg) dissolved in diethanolamine buffer • 
(96 ml per liter) pH 9.8 containing 56 mg/I MgCI 2 . 

10 The enzyme reaction was done at 37 °C and the 
development of a yellow color was monitored at 
405 nm using an ELISA plate reader. Results ob- 
tained for cell media are given in Table 3. 

Factor VII biological activity was assayed by 

75 the one-stage clotting assay described by Quick - 
(Hemorraoic Disease and Thrombisis. 2nd ed. t Leat 
Febiger, Philadelphia, 1966). Results obtained for 
cell media are given in Table 3. 

20 



TABLE 3 

Cells/ml Factor VI I Factor VII 

Day (xl0~ 4 ) polypeptide ng/ml activity (ng/ml) 

1 2.9 

2.7 25 6.0 

2 1.9 

2.8 47 15.9 

3 1.96 

2.26 160 93 

4 4.71 

4.14 550 300 

5 8.79 

11.28 725 . 531 

6 5.1 

8.4 975 600 
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Example 7: Expression of Factor IX 

Fourteen ug of F1X(-G) -» pUC13 were di- 
gested with 30 units of Bam HI in 30 ul of high salt s 
buffer for 3 hours at 37°C. The DMA was then 
subjected to electrophoresis in 1% agarose and the 
1.4 kb band contining the Factor IX sequence was 
isolated from the gel. 

Three ug of the vector pD2 were digested with 10 
30 units of Bam HI in 30 ul high salt buffer for 3 
hours at 37° C. The DNA was subjected to elec- 
trophoresis in 1% agarose and the linear 1.5 kb 
fragment isolated. The DNA was then treated with 
0.12 units calf alkaline phosphatase in 30 ul of 10 is 
mM Tris-HCI pH 8, 1 mM EDTA for 30 minutes at 
37°C. The salt was adjusted to 0.3 M NaOAc and 
the sample extracted twice with phenol, once with 
chloroform and the DNA was ethanol precipitated. 
The pellet was rinsed in 70% ethanol, dried and 20 
redissolved in 20 ul 10 mM Tris-HCI pH 8, 1 mM 

TABLE 



EDTA. Equimolar amounts (0.02 pmoles) of the two 
fragments were ligated with 10 units of T* DNA 
ligase as described above. The reaction mixture 
was used to transform E coli RR1 cells. DNA from 
twelve of the resulting ampicillin-resistant colonies 
was screened by restriction enzyme digestion. One 
of the clones with the 1.4 kb fragment inserted in 
the correct orientation was designated as FDC(-G)- 
/pD2. E coji RR1 transformed with FlX(-G)/pD2 has 
been deposited with ATCC under accession num- 
ber 53067. 

BHK cells were co-transfected with FIX(-G)/pD2 
and pKO-neo as described above. Drug-resistant 
cells were selected and prepared for EUSA and 
activity assay as described in Example 6. 

The assay for biological activity is based on 
the ability of Factor IX to reduce the clotting time 
of plasma from Factor IX-deficient patients to nor- 
mal. It was done as described by Procter and 
Rapaport (Amer. J. Clin. Path. 36: 212, 1961). Re- 
sults are shown in Table 4. 



Factor IX 
Cells/ml polypeptide (ng/ml) 
Day (xiQ- 4 ) supernatant pellet 

1 1.65 



2 2.66 57 20 

45 20 

3 9 .69 150 60 

120 60 

4 14.79 475 160 

225 140 

5 50.85 875 250 

1000 260 



The amount of Factor IX polypeptide was de- 
termined by EUSA essentially as described in Ex- 
ample 6 using polyclonal rabbit antisera to Factor 
IX. Following the incubation of the wells with^the 



Factor IX % active 

activity (ng/ml) protein in 
in supernatant supernatant 



27 50% 
24 

72 58% 
84 

198 50% 
150 

408 45% 
438 

Factor IX-containing samples, the wells were rinsed 
and incubatedl hour at room temperature with 200 
ul of affinity purified rabbit polyclonal anti-Factor IX 
conjugated to alkaline phosphatase diluted 1:1000 
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in PBS containing 1% BSA and 0.05% Tween 20. 
The wells were then rinsed four times with PBS 
containing 0.05% Tween 20, and enzyme substrate 
was added as above. Incubations were run at 4°C 
overnight or 37 °C for 2 hours. 

As shown in Table 4, 70-80% of the Factor IX 
polypeptide is secreted into the media, and about 
50% of this is biologically active. No Factor IX 
activity was detected in the cell pellets. 

Highest levels of activity were achieved by 
supplementing the cell culture medium with vitamin 
K (phytonadione, Merck) at concentrations of 1-10 
mg/ml. 

Several additional analyses were performed to 
demonstate that the cells were secreting authentic 
Factor IX. Samples containing Factor IX activity 
according to the above assay were incubated with 
Factor Vlll-deficient plasma but did not affect the 
clotting time, indicating that the activity was due to 
authentic Factor IX rather than a non-specific clot- 
ting agent. This conclusion was further verified by 
depletion of Factor IX activity from the samples 
with a specific antibody. Ninety-seven to ninety- 
eight percent of the Factor IX activity was im- 
munoprecipitated from cell supernatants with a rab- 
bit polyclonal antibody against Factor IX. This anti- 
body also precipitated over 99% of the Factor IX 
activity from normal plasma. No Factor IX activity 
was removed from the supernatants by rabbit poly- 
clonal antibody to erythropoietin. 

Example 8: Construction of an expression vector 
for Factor VII. 

An expression vector comprising the synthetic 
Factor VII 5' coding region joined to the partial 
Factor VII cDNA was constructed. The vector, des- 
ignated pM7135, was generated by inserting the 
Factor IX leader -5* Factor VII sequence from 
pM7115 and the 3* Factor VII sequence from 
FlX/VII/pD2 into plasmid pD3, which comprises the 
SV40 enhancer and the adenovirus 2 major late 
promoter and tripartite leader. 

Plasmid pD3 was generated from plasmid 
pDHFRIII. The Pst I site immediately upstream 
from the DHFR sequence in pDHFRIII was con- 
verted to a Bel I site by digesting 10 ug of plasmid 
with 5 Units of Pst I for 10' at 37°C in 100 ul buffer 
A (10 mM Tris pH 8, 10 mM MgCI,. 6 mM NaCI, 
7m M 0-MSH). The DNA was phenol extracted, 
EtOH precipitated, and resuspended in 40 ul buffer 
B (50 mM Tris pH 8, 7 mM MgCI„ 7 mM 0-MSH) 
containing 10 mM dCTP and 16 units T4 DNA 
polymerase and incubated at 12°C for 60 minutes. 
Following EtOH precipitation, the DNA was ligated 
to 2.5 ug kinased Bel I linkers in 14 ul buffer C (10 



mM Tris pH 8, 10 mM MgCI 2 , 1 mM DTT. 1.4 mM 
ATP) containing 400 units T4 polynucleotide ligase 
for 12 hours at 12°C. Following phenol extraction 
and EtOH precipitation, the DNA was resuspended 

5 in 120 ul buffer D (75 mM KCI, 6 mM Tris pH 7.5, 
10 mM MgCl 2l 1 mM DTT), digested with 80 units 
Bel I for 60 minutes at 50°C, then electrophoresed 
through agarose. Form III plasmid DNA (10 ug) was 
isolated from the gel, and ligated in 10 ul buffer C 

10 containing 50 units T4 polynucleotide ligase for 2 
hours at 12°C, and used to transform E coH 
HB101. Positive colonies were identified by rapid 
DNA preparation analysis, and plasmid DNA - 
(designated pDHFR') prepared from positive colo- 

75 nies was transformed into dAM* E. coji. 

Plasmid pD2* was then generated by cleaving 
pDHFR* (15 ug) and pSV40 (25 ug) in 100 ul buffer 
D with 25 units Bel I for 60 minutes at 50°C, 
followed by the addition of 50 units Bam HI and 

20 additional incubation at 37°C for 60 minutes. DNA 
fragments were resolved by agarose gel elec- 
trophoresis, and the 4.9 kb pDHFR* fragment and 
0.2 kb SV40 fragment were isolated. These frag- 
ments (200 ng pDHFR' DNA and 100 ng SV40 

25 DNA) were incubated in 10 ul buffer C containing 
100 units T4 polynucleotide ligase for 4 hours at 
12°C, and the resulting construct (pD2*) used to 
transform E. coH RRI. 

Plasmid pD2' was modified by deleting the 

30 "poison" sequences in the pBR 322 region (Lusky 
and Botcham, Nature 293 : 79-81, 1981). Plasmids 
pD2' (6.6 ug) and pML-1 (Lusky and Botcham, 
ibid) (4 ug) were incubated in 50 ul buffer A with 10 
units each Eco Rl and Nru I for 2 hours at 37° C, 

35 followed by agarose gel electrophoresis. The 1.7 
kb pD2' fragment and 1.8 kb pML-1 fragment were 
isolated and ligated together (50 ng each) in 20 ul 
buffer C containing 100 units T4 polynucleotide 
ligase for 2 hours at 12°C, followed by transforma- 

40 tion into E. coH HB101. Colonies containing the 
desired construct (designated A pD2) were iden- 
tified by rapid preparation analysis. Ten ug of A 
pD2 were then digested with 20 units each Eco Rl 
and Bgl II, in 50 ul buffer A for 2 hours at 37° C. 

45 The DNA was electrophoresed through agarose, 
and the desired 2.8 kb fragment (fragment C) com- 
prising the pBR322, 3 1 splice site and poly A 
sequences was isolated/ 

To generate the remaining fragments used in 

so constructing pD3, pDHFRIII was modified to con- 
vert the Sac II (Sst II) site into either a Hind III or 
Kpn I site. Ten ug pDHFRIII were digested with 20 
units Sst II for 2 hours at 37°C, followed by phenol 
extraction and ethanol precipitation. Resuspended 

55 DNA was incubated in 100 ul buffer B containing 
10 mM dCTP and 16 units T4 DNA polymerase for 
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60 minutes at 12°C, phenol extracted, dialyzed, 
and ethanol precipitated. DNA (5 ug) was ligated 
with 50 ng kinased Hind III or Kpn I linkers in 20 ul 
buffer C containing 400 units T4 DNA ligase for 10 
hours at 12°C, phenol extracted, and ethanol 
precipitated. After resuspension in 50 ul buffer A, 
the resultant plasmids were digested with 50 units 
Hind 111 or Kpn I, as appropriate, and electrophores- 
ed through agarose. Gel-isolated DNA (250 ng) was 
ligated "in 30 ul buffer C containing 400 units T4 
DNA ligase for 4 hours at 12°C and used to 
transform E. coH RRl. The resultant plasmids were 
designated pDHFRIII (Hind III) and pDHFRIII (Kpn 
I). A 700 bp Kpn l-Bgl II fragment (fragment A) was 
then purified from pDHFRIII (Kpn I) by digestion 
with Bgl II and Kpn I followed by agarose gel 
electrophoresis. 

The SV40 enhancer sequence was inserted 
into pDHFRIII (Hind III) as follows: 50 ug SV40 DNA 
was incubated in 120 ul buffer A with 50 units Hind 
ill for 2 hours at 37°C, and the Hind III C SV40 
fragment (5089-968 bp) was gel purified. Plasmid 
pDHFRIII (Hind III) (10 ug) was treated with 250 ng 
calf intestinal phosphatase for 1 hour at 37° C, 
phenol extracted and ethanol precipitated. The lin- 
earized plasmid (50 ng) was ligated with 250 ng 
Hind III C SV40 in 16 ul buffer C for 3 hours at 
12°C, using 200 units T4 polynucleotide ligase, 
and transformed into E. co|i HB101. A 700 base 
pair Eco RI-Kpn I fragment (fragment B) was then 
isolated from this plasmid. 

For the final construction of pD3, fragments A 
and B (50 ng each) were ligated with 10 ng frag- 
ment C with 200 units T4 polynucleotide ligase for 
4 hours at 12°C, followed by transfection of E. coli 
RRl. Positive colonies were detected by rapid prep- 
aration analysis/ and a large-scale preparation of 
pD3 was made. 

Expression vector pM7135 was then construct- 
ed. The replicative form of pM7115 was digested 
with Bam HI and Xba I and the 550 base pair 
fragment comprising the Factor IX leader and 5* 
Factor VII sequence was gel purified. Plasmid 
FIX/Vll/pD2 was digested with Xba I and Bam HI 
and the 1700 bp fragment comprising the 3' portion 
of the Factor VII cDNA was gel purified. Plasmid 
pD3 was digested with Bel I, treated with calf 
alkaline phosphatase, and the three fragments 
joined in a triple ligation. The resultant constructs 
were screened for the presence of a 2000 base 
pair Xba I fragment A plasmid having the correct 
orientation was selected and designated pM7135 - 
(Figure 8). 



Example 9: Expression of Factor VII From cDNA 
Clones. 

In order to express Factor VII cDNA containing 

5 a Factor Vll leader, DNA from XV112463 or XVII565 
and XVI 12463 was cloned into an expression vector 
containing the Ad2 major late promoter, SV40 en- 
hancer sequences, the Ad2 tripartite leader, a 
splice set and the SV40 por/adenylation signal. 

10 This vector was -adapted so that it contains a 
unique EcoRI sequence as the site of cDNA inser- 
tion. The expression of sequences from XVII2463, 
which encodes a 60 amino acid leader, and from 
XVII565 and XVII2463, which lacks the codons for 

is amino acids from -18 to -39 and thus encodes a 
leader 38 amino adds in length, were evaluated. 
Because the structure of the Factor Vll lead©- has 
only been identified by cDNA cloning; and because 
of the ambiguity generated by having obtained two 

20 different 5* -terminal cDNAs, the inventors also 
constructed a genomic-cDNA Factor VII sequence. 
The 3 f portion of XV1I2463 (from the Bgl 11 site in 
exon 2, to the EcoRI site Hnkered 3' to the poly(A) 
tail) was adjoined to a subgenomic fragment of 

25 clone 7m1, that encoding exons 1a, 1b and the 
remainder of exon 2. This subgenomic fragment 
reconstructed as an EcoRI-Bglll 4.4 Kb fragment 
was adjoined to the XVII2463 cDNA and cloned into 
a mammalian expression vector. 

30 Briefly, in order to construct the subclones, the 
Factor Vll cDNA EcoRI fragment of XV112463 was 
cloned into the EcoRI site of pUC18, and des- 
ignated pVI!2463. Similarly, the EcoRI cDNA insert 
of XV1I565 was subcloned into pUCl8, and des- 

35 ignated pVII565. A hybrid between the 5* portion of 
the Factor Vll sequence of clone pVII565 and the 3 1 
segment of Factor Vll DNA of pVH2463 was con- 
structed by cloning the S'-most EcoRI-Bgl II Factor 
VII fragment of pVII565 and the Bgl ll-Hind 111 

40 Factor Vll fragment (Hind III site in polytinker of 
PV1I2463) of pVll2463 into pUC 18 digested with 
EcoRI and Hind III. This construct was designated 
pVH2397. The inserts of pVlI2463 and pVK2397 
were removed by EcoRI digestion and gel purified 

45 for insertion into mammalian expression vectors as 
described below. 

A. Expression of full-length Factor Vll cDNA. 

so The expression of Factor Vll was achieved in 
the vector pDX. This vector was derived from pD3 
(described in Example 8 above) and pD3\ a vector 
identical to pD3 except that the SV40 polyadenyla- 
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tion signal (i.e. the SV40 Bam HI [2533 bp] to Bell 
[2770bp] fragment) is in the late orientation. Thus, 
pD3' contains a Bam HI site as the site of gene 
insertion. 

To generate pDX, the EcoRI site in pD3' was 
converted to a Bel I site by Eco Rl cleavage, 
incubation with S1 nuclease, and subsequent liga- 
tion with Bel I linkers. DNA was prepared from a 
positively identified colony, and the 1.9 kb Xhol- 
Pstl fragment containing the altered restriction site 
was prepared via agarose gel electrophoresis. In a 
second modification, Bel l-cleaved pD3 was iigated 
with kinased Eco Rl-Bcl I adaptors (constructed 
from oligonucleotides ZC 525, 5 ' GGAATTCT 3 '; and 
ZC526, 5 *GATCAGAATTCC 3 ) in order to generate 
an Eco Rl site as the position for inserting a gene 
into the expression vector. Positive colonies were 
identified by restriction endonuclease analysis, and 
DNA from this was used to isolate a 2.3 kb Xhol- 
Pstl fragment containing the modifed restriction 
site. The two above-described DNA fragments were 
incubated together with T4 DNA ligase, trans- 
formed into E. coh HB101 and positive colonies 
were identified by restriction analysis. A prepara- 
tion of such DNA, termed pDX, was made (Figure 
12). This DNA was cleaved with Eco Rl and subse- 
quently incubated with calf-intestinal phosphatase. 
The purified DNA was then incubated with T4 DNA 
ligase and the Factor VII Eco Rl fragment from 
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PVII2463, or with the Factor VII Eco Rl cDNA 
fragment derived from pVII2397. The resultant 
clones were designated FVII(2463)/pDX and FVII- 
(565 + 2463)/pDX, respectively (Figure 12). After 
transformation into E. coli JM83 and subsequent 
identification by restriction enzyme analysis, plas- 
mid DNA preparations were made and checked by 
extensive restriction endonuclease digestion. The 
plasmids FVII(2463)/pDX and FVII(555 + 2463)- 
/pDX have been deposited with American Type 
Culture Collection and have been assigned acces- 
sion numbers 40206; and 40205, respectively. 

FVII(2463)/pDX and FVII(565 + 2463)/pDX (10 
ug each) were each transfected, along with 10 ug 
salmon sperm carrier DNA, into either BHK tkt13 
cells (Floros et ah. Exper. Cell Res. 132 : 215-223, 
1981) or COS cells, using standard calcium-phos- 
phate precipitation. Following transfection, the cells 
were cultured in the appropriate media containing 5 
ug/ml vitamin K for two days. At this time, the 
supernatants were assayed for ELISA-positive ma- 
terial, using a monoclonal antibody directed against 
Factor VII. Both FVIi(2463)/pDX and FVII - 
(565 + 2463)/pDX directed the production of Factor 
VII polypeptide which was detected in COS cell 
supernatants, and Factor VII from FV1I(565 + 2463)- 
pDX was detected in BHK cell supernatant Sham- 
transfected BHK cells or COS cells did not yield 
detectable levels of Factor VII (Table 5). 
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TABLE 5 



ELISA Positive 



DNA 


Cell 
Line 


Cell 
Number 


Material (ng/ml 
Culture Medium) 


FVII ( 2463 )/pDX 


COS 


2 x 10 6 


15 


FVII ( 565+2463 )/pDX 


COS 


2 x 10 6 


12 


Control 


COS 


2 x 10 6 


<2 


FVII ( 2463 )/pDX 


BHK 


9 x 10 6 


62 


FVII (565+2463 )/pDX 


BHK 


9 x 10 6 


6 


Control 


BHK 


9 x 10 6 


<2 



Transient expression of Factor VII and afso 
tested in several other cell lines, listed in Table 6. 
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Name 

1. Rat Hep I 

2. Rat Hep II 

3. TCMK 

4. Human lung 

5. Human 
hepatoma 

6. Hep G2 

7. Mouse liver 

8. COS 

9. BHK 

10. 293 

11. DUKX 



TABLE 6 
Description 

Rat hepatoma H4-IIr-E-C3 

Rat hepatoma H4-II-E 

Mouse Kidney , SV40 virus 
transformed, TCMK-1 

SV40 virus transformed 
WI-38 VA13, subline 2RA 

Human liver adenocarcinoma 
SK-HEP-1 

Human hepatoma/ dev. by 
Barbara Knowles/Wistar 
Institute 

NCTC clone 1469 

SV40 -transformed CV-1 
(monkey) cells 

Baby hamster kidney 
BHK-21 (C-13) 

Human embryonic 
Kidney/ Ad transformed 

CH0-DHFR sens 



Reference 
(ATTC #) 

CRL 1600 

CRL 1548 

CCL 139 

CCL 75.1 

HTB-52 

HTB 8065 

CC 29.1 
CRL 1650 

CCL 10 

CRL 1573 

(Urlaub & 
Chasin, 
PNAS (USA) 
77: 4216- 
4220, 1980) 
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Cells were cotransfected with 10 ug of either 
FVU(2463)/pDX or FVU(565 + 2463)/pDX together 
with 1 ug of a plasmid comprising the chloram- 
phenicol acetyl transacetylase gene (to permit 



identification of cotransfected cells) and 10 ug of 
salmon sperm DNA. Mock transfected ceils were 
used as controls. Spent media were assayed by 
EUSA after six days. Results are given in Table 7. 
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' TABLE 7 










_ 


Cell 










Number 


ELISA 


Sample 


Cell Line 


Plasmid 


x 10" 6 


( ng/ml ) 


1. 


Rat Hep I 


Mock 


11.6 


2.4 


2. 


Rat Hep I 


FVIK 565+2463 )/pDX 


7.0 


2 


3. 


Rat Hep I 


FVIK 2463 )/pDX 


7.0 


<2 


4. 


Rat Hep 2 


Mock 


13.0 


<2 


5. 


Rat Hep 2 • 


PVII (565+2463 )/pDX 


18.4 


<2 


. 6. 


Rat Hep 2 


FVII(2463)/pDX 


14.6 


<2 


7. 


TCMK 


Mock 


18.8 


<2 


8. 


TCMK 


FVIK 565+2463 )/pDX 


9.8 


<2 


9. 


TCMK 


FVIK 2463 )/pDX 


12.8 


<2 


10. 


Human Lung 


Mock 


7.2 


<2 


11. 


Human Lung 


FVIK565+2463)/pDX 


3.4 


16.5 
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12. 


Human Lung 


FVII(2463)/pDX 


3.4 


12.2 


13. 


Human Hepatoma 


Mock 


13.0 


<2 


14. 


Human Hepatoma 


FVII (565+2463 )/pDX 


11.0 


3.5 


15. 


Human Hepatoma 


FVII(2463)/pDX 


6.0 


3.0 


16. 


HepG2 


Mock 


6.8 


21 


17 . 


HepG2 


FVII (565+2463 )/pDX 


6.0 


45.5 


18. 


HepG2 


FVII (2463 )/pDX 


6.0 


28 


19. 


Mouse Liver 


Mock 


3.8 


<2 


20. 


Mouse Liver 


FVII (565+2463 )/pDX 


4.0 


<2 


21. 


Mouse Liver 


FVII(2463)/pDX 


3.6 


<2 


22. 


COS 


Mock 


5.6 


<2 


23. 


COS 


FVII ( 565+2463 ) /pDX 


5.6 


15.5 


24. 


COS 


FVII ( 2463 )/pDX 


4.4 


14.5 


25. 


BHK tk"tl3 


Mock 

r 


3.0 


<2 


26. 


BHK tk~tl3 


FVII (565+2463) /pDX 


5.0 


25 


27. 


BKH tk~tl3~ 


FVII (2463) /pDX 


4.0 


22.5 


28 . 


293 


Mock 


5.8 


<2 


29. 


293 


FVII ( 565+2463 )/dDX 


6.2 


94 


30 . 


293 


FVII (2463 )/pDX 


8 .2 


100 


31. 


DUKX 


Mock 


11.6 


<2 


32. 


DUKX 


FVII ( 565+2463 )/pDX 


13.0 


<2 


33. 


DUKX 


FVII ( 2463 )/pDX 


13.6 


<2 



FVII(2463)/pDX (10 ug) or FVII(565 + 2463)/pDX 
(10 ug) was co-transfected with 10 ug of salmon 
sperm DNA and 1 ug of a plasmid encoding the 
resistant form of dihydrofolate reductase (Simonsen 
and Levinson, Proc. Natl. Acad. Sci. USA 80 : 
2495-2499, 1983) in a mammalian .expression vec- 
tor, into BHK tk*t13 cells. After two days, the cells 
were split 1:14 and placed into selective media 
containing either 250 nM or 1000 nM methotrexate 
(MTX) and 5 ug/ml vitamin K (phytadione, Merck). 
After two weeks, colonies were isolated and grown 
to 50-90% confluency. The supernatant media 
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were then assayed for Factor VII polypeptide by 
ELISA. Of the 25 positive clones, 22 were analyzed 
further. The cells were platQd at 5 x 10* (Group I) 
or 1 x 10 s (Group II) in 10 cm dishes containing 5 
ug/ml vitamin K, and either 250 nM or 1000 nM 
methotrexate. Five days later, the faster growing 
clones (designated by an asterisk in Table 8) were 
spirt 1:2, then after 24 hours, the media were 
changed on ail plates. Twenty-three hours (Group 
I) or 20 hours (Group II) later, supernatant media 
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were harvested and cell counts were taken for each 
clone. The media were assayed both by ELISA and 
by the one-stage clotting assay. Results are shown 
in Table 8. 

5 



TABLE 8 
* GROUP I - 23-hour assay 



Clone 


Cell 
Count 

Plasinid X10" 5 


ELISA 

cell/ 
day) 


ELI SA 

(ng/ 
ml) 


Clot- 
ting 
(ng/ 
ml) 


% 

Active 


B4— AJ." 


FVIK 565+2463 )/pDX 


2 


O • 3 


i ^n 




IJO 


B4-B1* 


FVIK 565+2463 )/pDX 


27 


1.9 


513 


360 


70 


B4-C1 


FVIK 565+2463 )/pDX 


16 


2.5 


393 


480 


122 


B4-C2* 


FVIK 565+2463 )/pDX 


9 


<0.2 


<20 


21 




B4-C3 


FVIK 565+2,463 )/pDX 


52 


1.5 


800 


910 


114 


B4-D1* 


FVIK 565+2463 )/pDX 


27 


2.0 


553 


570 


103 


B4-D2* 


FVIK 565+2463 )/pDX 


13 


1.2 


150 


154 


103 


B4-E1 


• 

FVII ( 565+2463 ) /pDX 


39 


2.2 


870 


1160 


133 


B4-E2* 


FVII ( 565+2463 ) /pDX 


8 


2.5 


205 


240 


. 117 


B4-E3 


FVIK 565+2463 ) /pDX 


23 


1.2 


275 


320 


116 


B4-E4 


FVIK 565+2463 )/pDX 


31 


1.3 


410 


300 


73 


B3-5.3* 


FVIK 2463 )/pDX 


5 


8.2 


410 


290 


70 




GROUP II 


- 20-hour assay 






Clone 


i 
i 

Plasmid 


Cell 
Count 

xicr 5 


ELISA 

<pg/ 
cell/ 
day) 


ELISA 
(ng/ 
ml) 


Clot- 
ting 
(ng/ 
ml) 


% 

Active 


B3-2.2 


FVIK 2463 )/pDX 


41 


2.5 


1043 


500 


48 


B3-2.3 


FVIK 2463 )/pDX 


19 


3.0 


580 


610 


105 


B3-3.2 


FVIK 2463 )/pDX 


13 


1.5 


197 


216 


110 


B3-4.2 


FVIK 2463 )/pDX 


41 


1.8 


760 


620 


82 


B3-5.1 


FVIK 2463 )/pDX 


14 


3.3 


460 


400 


87 


B3-5.2 


FVIK2463)/pDX 


9 


2.7 


257 


184 


72 


B6-D 


FVIK 2463 )/pDX 


54 


3.0 


1700 


780 


46 


B6-E 


FVIK 2463 )/pDX 


101 


1.6 


1640 


970 


59 


B6-G 


FVIK 2463 )/pDX 


31 


5.7 


1853 


1080 


58 


B6-M 


FVIK 2463 )/pDX 


75 


2.2 


1743 


940 


54 
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B. Expression of Factor VIII genomic-cDNA hybrid. 

An expression vector containing genomic se- 
quences representing the Factor VII genomic 5* - 
terminus and cDNA sequences from the Factor VII 
gene 3'-terminus was prepared as follows. Three 
subclones of the genomic plasmid 7 ml were used 
to reconstruct the S'-terminus: 7Bam f *7SD and 
7SE. Plasmid 7Bam is a 3.6 Kb EcoRI-Bam HI 
fragment containing exon 1a, subcloned into 
PUC12. A 0.7 Kb EcoRl-Xbai fragment, which con- 
tains exon 1a, was isolated from this subclone and 
is designated fragment a. Plasmid 7SD is a 3.7 Kb 
Sstl fragment containing exon 1b, subcloned into 
PUC18. An exon 1b-containing 3.1 Kb Xbal-Ssti 
fragment was isolated from this subclone and is 
designated fragment b. Plasmid 7SE is a 3.9 Kb 
Sstl fragment containing exons 2-4, subcloned into 
M13mp 19. An Ssti-Bgl II (0.6 Kb) fragment con- 
taining the 5' part of exon 2 was gel isolated and is 
designated fragment c. The remainder of the 3- 
Factor VII cDNA (fragment d) was obtained as a 2 
Kb Bgi U-EcoRI fragment from pUVI!2463. Frag- 
ments a-d were Rgated with EcoRI-cIeaved and calf 
intestinal-phosphatased pDX, and then transformed 
into E. coli JM83 or HB101. Positive colonies were 
identified by restriction endonuclease analysis, and 
plasmid DNA was prepared from these colonies. 

For expression Factor VII, the plasmid DNA is 
co-transfected into BHK or COS cells as described 
above. Transfected cells are cultured in vitamin K- 
containing medium for 2 days, and the medium is 
assayed for Factor VII by ELISA. 

From the foregoing it will be appreciated that, 
although specific embodiments of the invention 
have been described herein for purposes of illustra- 
tion, various modifications may be made without 
deviating from the spirit and scope of the invention. 
Accordingly, the invention is not to be limited ex- 
cept by the appended claims. 

The various strains of E. Coli used in the 
foregoing Examples were the personal choice of 
the inventors. Persons skilled in this art will appre- 
ciate that other suitable strains of E. Coli could be 
substituted (for example if the person already has 
in his possession samples of other suitable E. Coli 
strains). 

Transformants ATCC 53067 and 53068 were 
deposited with the American Type Culture Collec- 
tion on 28 March 1985 and plasmids ATCC 40205 
and 40206 were deposited with the American Type 
Culture Collection on 25 November 1985, all four 
deposits being made in accordance with the Bu- 
dapest Treaty. 



The features disclosed in the foregoing de- 
scription, in the following claims and/or in the ac- 
companying drawings may, both separately and in 
s any combination thereof, be material for realising 
the invention in diverse forms thereof. 

Claims 

10 

1. A DNA construct containing a nucleotide se- 
quence encoding a protein which upon activation 
has the* same or substantially the same biological 
activity for blood coagulation as Factor Vila 

75 

2. The DNA construct of claim 1 wherein said 
nucleotide sequence codes at least partially for 
Factor VII, said nucleotide sequence comprising a 
first nucleotide sequence encoding a calcium bind- 

20 ing domain, joined to a second nucleotide se- 
quence positioned downstream of said first se- 
quence, said second sequence encoding a cata- 
lytic domain for the serine protease activity of 
Factor Vila, the joined sequences coding for a 

25 protein which upon activiation has the same or 
substantially the same biological activity for blood 
coagulation as Factor Vila. 

3. The DNA construct of claim 2 wherein said first 
30 nucleotide sequence is substantially that of a gene 

selected from the group consisting of genes encod- 
ing Factor VII, Factor IX, Factor X, Protein C, 
prothrombin, and Protein S. 

35 4. The DNA construct of claim 2 wherein said first 
nucleotide sequence also encodes a leader pep- 
tide. 

5. The DNA construct of claim 2 wherein said first 
40 nucleotide sequence includes a synthesized 

double-stranded oligonucleotide. 

6. The DNA construct of claim 5 wherein said 
synthesized double-stranded oligonucleotide codes 

45 for substantially the amino-terminal portion of Fac- 
tor VII. 

7. The DNA construct of claim 1 wherein said 
nucleotide sequence encodes Factor VII. 

50 

8. The DNA construct of claim 7 wherein at least a 
portion of said nucleotide sequence is derived from 
a cDNA clone or a genomic clone of Factor VII. 

55 9. The DNA construct of claim 7 comprising a first 
nucleotide sequence derived from a cDNA or a 
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genomic clone of Factor VII, joined to a second 
nucleotide sequence positioned downstream of 
said first sequence, said second sequence derived 
from a cDNA clone of Factor VII, the joined se- 
quences coding for a protein which upon activation 5 
has the same or substantially the same biological 
activity for blood coagulation as Factor Vila. 

10. The DNA construct of claim 7 wherein said 
nucleotide sequence comprises the cDNA se- 10 
quence of Figure 1b, from bp 36 to bp 1433. 

11. The DNA construct of claim 7 wherein said 
nucleotide sequence comprises the cDNA se- 
quence of Figure 1b f from bp 36 to bp 99, followed is 
downstream by the sequence from bp 166 to bp 
1433. 

12. A recombinant plasmid capable of integration in 
mammalian host cell DNA, said piasmid including a 20 
promoter followed downstream by a nucleotide se- 
quence according to any of the previous claims 1- 

1 1 , said nucleotide sequence being followed down- 
stream by a polyadenylation signal. 

25 

13. Mammalian cells transfected with a recom- 
binant plasmid according to claim 12. 

14. A method for producing a protein having bio- 
logical activity for blood coagulation mediated by 30 
Factor Vila, comprising: 

establishing a mammalian host cell which contains 
a DNA construct according to any of the previous 
claims 1-11; 35 

growing said mammalian host cell in an appropriate 
medium; 



isolating the protein product encoded by said DNA 
construct produced by said mammalian host cell; 
and 

activating said protein product to generate a protein 
which has the same or substantially the same 
biological activity for blood coagulation as Factor 
Vila. 

15. The method of claim 14, including amplification 
of the DNA construct by cotransfection of the host 
cell with a gene encoding dihydrofolate reductase, 
wherein the appropriate medium comprises 
methotrexate. 

16. The method of claim 14 wherein said protein 
product is activated by reacting the protein with a 
proteolytic enzyme selected from the group con- 
sisting of Factor Xlla, Factor IXa, kallikrein, Factor 
Xa, and thrombin. 

17. A pharmaceutical preparation for the treatment 
of bleeding disorders containing a protein prepared 
according to claim 14. 

18. A method of producing a protein having biologi- 
cal activity for blood coagulation mediated by Fac- 
tor Vila, which process comprises growing in an 
appropriate medium an established mammalian 
host cell which contains a DNA construct in accor- 
dance with any one of Claims 1 to 1 1 , isolating the 
protein product encoded by said DNA construct 
produced by said mammalian host cell and activat- 
ing said protein product to generate a protein 
which has the same or substantially the same 
biological activity for blood coagulation as Factor 
Vila. 
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FIG. 1A 



EcoRJa 24 39 54 

CAATTCC GG TGC AGG ACG AAG CTG TTC TGG ATT TCT TAC AGT GAT GGG GAC CAG 
Arg Thr Lys Leu Phe Trp He Ser Tyr. Ser Asp Gly Asp Gin 

69 to 99 

TGT GCC TCA AGT CCA TGC CAG AAT GGG GGC TCC TGC AAG GAC CAG CTC CAG TCC 
Cys Ala Ser Ser Pro Cys Gin Asn Gly Gly Ser Cys Lys Asp Gin Leu Gin Ser 

114 129 144 159 

TAT ATC TGC TTC TGC CTC CCT GCC TTC GAG GGC CGG AAC TGT GAG ACG CAC AAG 
Tyr lie Cys Phe Cys Leu Pro Ala Phe Glu Gly Arg Asn Cys GIu Thr His Lys 

174 189 204 Psc la 

GAT GAC CAG CTG ATC TGT GTG AAC GAG AAC GGC GGC TGT GAG CAG TA C TGC AGT 
Asp Asp Gin Leu lie Cys Val Asn Glu Asn GJy Gly Cys Glu Gin Tyr Cys Ser 

219 234 249 264 

GAC CAC ACG GGC ACC AAG CGC TCC TGT CGG TGC CAC GAG GGG TAC TCT CTG CTG 

Asp His Thr Gly Thr Lys Arg Ser Cys Arg Cys His Glu Gly Tyr Ser Leu Leu 

279 294 309 324 

GCA GAC GGG GTG TCC TGC ACA CCC ACA GTT GAA TAT CCA TGT GGA AAA ATA CCT 
Ala Asp Gly Val Ser Cys Thr Pro Thr Val Glu Tyr Pro Cys Gly. Lys lie Pro 

Xba I 339 354 369 

AT T CTA CAA AAA AGA AAT GCC ACC AAA CCC CAA GGC CTSA ATT GTG GGG GGC AAG 
lie Leu Glu Lys Arg Asn Ala Ser Lys Pro Gin Gly Arg Me Val Gly Gly Lys 

384 399 414 429 

GTG TGC CCC AAA GGG GAG TGT CCA TGG CAG GTC CTG TTG TTG GTG AAT GGA GCT 
Val Cys Pro Lys Gly Glu Cys Pro Trp Gin Val Leu Leu Leu Val Asn Gly Ala 

444 459 474 

CAG TTG TGT GGG GGG ACC CTG ATC AAC ACC ATC TGG GTG GTC TCC GCG GCC CAC 
Gin Leu Cys Gly Gly Thr Leu lie Asn Thr lie Trp Val Val Ser Ala Ala His 

489 504 519 534 

TGT TTC GAC AAA ATC AAG AAC TGG AGG AAC CTG ATC GCG GTG CTG GGC GAG CAC 

Cys Phe Asp Lys He Lys Asn Trp Arg Asn Leu lie Ala Val Leu Gly Glu His 

549 564 579 594 

GAC CTC AGC GAG CAC GAC GGG GAT GAG CAG AGC CGG CGG GTG GCG CAG GTC ATC 
Asp Leu Ser Glu His Asp Gly Asp Glu Gin Ser Arg Arg Val Ala Gin Val lie 

609 Sma I 624 639 

ATC CCC AGC ACG TAC GT C CC6 GG C ACC ACC AAC CAC GAC ATC GCG CTG CTC CGC 
He Pro Ser Thr Tyr Val Pro Gly Thr Thr Asn His Asp lie Ala Leu Leu Arg 

654 669 684 699 

CTG CAC CAG CCC GTG GTC CTC ACT GAC CAT GTG GTG CCC CTC TGC CTG CCC GAA 
Leu His Gin Pro Val Val Leu Thr Asp His Val Val Pro Leu Cys Leu Pro Glu 
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714 729 744 

CGG ACC TTC TCT GAG AGG ACG CTG GCC TTC GTG CGC TTC TCA TTG GTC AGC GGC 
Arg Thr Phe Ser Glu Arg Thr Leu Ala Phe Val Arg Phe Ser Leu Val Ser Gly 

759 774 Nar I 789 804 

TOO GGC CAG CTG CTG GAC CGT GGC GCC ACG GCC CTG GAG CTC ATG GTC CTC AAC 

Trp Gly Gin Leu Leu Asp Arg Gly Ala Thr Ala Leu Glu leu Met Val .Leu Asn 

819 834 Pst lb 849 86d 

GTG CCC CGG CTG ATG ACC CAG GAC TGC CTG CAG CAG TCA CGG AAC GTG GGA GAC 
Val Pro Arg Leu Net Thr Gin Asp Cys Leu Gin Gin Ser Arg Lys Val Gly Asp 

879 894 909 

TCC CCA AAT ATC ACG GAG TAC ATG TTC TGT GCC GGC TAC TCG GAT GGC AGC AAG 
Ser Pro Asn lie Thr Glu Tyr Met Phe Cys Ala Gly Tyr Ser Asp Gly Ser Lys 

924 939 954 969 

GAC TCC TGC AAG GGG GAC ACT GGA GGC CCA CAT GCC ACC CAC TAC CGG GGC ACG 
Asp Ser Cys Lys Gly Asp Ser Gly Gly Pro His Ala Thr His Tyr Arg Gly Thr 

984 999 1014 

TGG TAC CTG ACG GGC ATC GTC AGC TGG GGC CAG GGC TGC GCA ACC GTG GGC CAC 
Trp Tyr Leu Thr Gly Me Val Ser Trp Gly Gin Gly Cys Ala Thr Val Gly His 

1029 1044 1059 TaqI 1074 

TTT GGG GTG TAC ACC AGG GTC TCC CAG TAC ATC GAG TGG CTG CAA AAG CTC ATG 
Phe Gly Val Tyr Thr Arg Val Ser Gin Tyr lie Glu Trp Leu Gin Lys Leu Met 

1089 1104 1119 1138 

CGC TCA GAG CCA CGC CCA GGA GTC CTC CTG CGA GCC CCA TTT CCC TAG CCCAGCAGCC 
Arg Ser Glu Pro Arg Pro Gly Val Leu Leu Arg Ala Pro Phe Pro 

Pstlc 

1148 1158 1168 1178 1 188 1198 1208 

CTGGCCTGTG GAGAGAAAGC CAAGGCTGCG TCGAACTGTC CTGGCACCAA ATCCCATATA TTCTTCTGCA 



1218 1228 1238 1248 1258 1268 1278 

GJTAATGGGG TAGAGGAGGG CATGGGAGGG AGGGAGAGGT GGGGAGGGAG ACAGAGACAG AAACAGAGAG 



1288 1298 1308 1318 1328 1338 1348 

AGACAGAGAC AGAGAGAGAC TGAGGGAGAG ACTCTGAGGA CCATGGACAG AGACTCAAAG AGACTCCAAG 



1358 1368 1378 1388 1398 1408 1418 

ATTCAAAGAG ACTAATAGAG ACACAGAGAT GGAATAGAAA AGATGAGAGG CAGAGGCAGA CAGGCGCTGG 



1428 1438 1448 1458 1468 1478 1488 

ACAGAGGGGC AGGGGAGTGC CAAGGTTGTC CTGGAGGCAG ACAGCCCAGC TGAGCCTCCT TACCTCCCTT 
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1*98 1508 1518 1528 1538 15*8 1558 

CAGCCAAGCC CCACCTCCAC GTGATCTGCT GGCCCTCAGG CTGCTGCTCT GCCTTCATTG CTGGAGACAG 

1568 1578 1588 1598 1608 161 8 1628 

TAGAGGCATG ACACACATGG ATGCACACAC ACACACGCCA TGCACACACA CAGAGATATG CACACACACG 

1638 1648 1658 1668 1678 1688 I698 

GATGCACACA CAGATGGTCA CACAGAGTAC CCAAACACAC CGATGCACAC GCACATAGAG ATATGCACAC 

1708 1718 1728 . 1738 17*8 1758 1768 

ACAGATGCAC ACACAGATAT ACACATGCA6 TGCACGCACA TGCCAATGCA CGCACACATC AGTCCACACG 

1 77 8 1788 1798 1808 1818 I828 1838 

GATGCACAGA GATATGCACA CACCGAT6TG CGCACACACA GATAT6CACA CACATGGAT6 ACCACACACA 

181,8 1858 1868 I878 1888 I898 1908 

CACCAAGTGC GCACACACAC CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACC6 ATGCTGACTC 

l 9 i8 1928 1938 19*8 1958 1968 1978 

CATGTGTGCT GTCCTCTGAA 6GCGGTTGTT TAGCTCTCAC TTTTCTGGTT CTTATCCATT ATCATCTTCA 

1988 1998 2008 2018 2028 2038 20*8 

CTTCAGACAA TTCAGAAGCA TCACCATGCA TGGTGGCGAA T6CCCCCAAA CTCTCCCCCA AATGTATTTC 

2058 2068 2078 2088 2098 2108 2118 

TCCCTTCGCT GGGTGCCGGG CTGCACAGAC TATTCCCCAC CTGCTTCCCA GCTTCACAAT AAACGGCTGC 



2128 2138 21*8 2158 2168 EcoRIb 

GTCTCCTC6C AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAGGAATTC 
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FIG. IB -«o 

MetValSerGlnAlaLeuArgLeuLeu 
TCAACAGGCAGGGGCAGCACTGCAGAGATTTCATCATGGTCTCCCAGGCCCTCAGGCTCCTC 
10 20 30 40 50 60 

-50 -40 1 

CysLeuLeuLeuGlyLeuGlnGlyCysLeuAlaAlaGlyGlyValAlaLysAlaSerGlyGly 
TGCCrrCTGCTTGGGCTTCAGGGCTGCCTGGCTGCAGGCGGGGTCGCTAAGGCCTCAGGAGGA 
70 80 90 100 110 120 

-30 -20 \ 

GluThrArgAspMetProTrpLysProGlyProHisArgValPheValThrGlnGluGlu 
GAAACACGGGACATGCCGTGGAAGCCGGGGCCTCACAGAGTCTTCGTAACCCAGGAGGAA 
130 140 150 160 170 180 

-10 -1 +1 +10 

AlaHisGlyValLeuHisArgArgArg ArgAlaAsnAlaPheLeuGluGluLeuArgPro 
GCCCACGGCGTCCTGCACCGGCGCCGGCGCGCCAACGCGTTCCTGGAGGAGCTGCGGCCG 
190 200 210 220 230 240 

+20 +30 
GlySerLeuGluArgGluCysLysGluGluGlnCysSerPheGluGluAlaArgGluIle 
GGCTCCCTGGAGAGGGAGTGCAAGGAGGAGCAGTGCTCCTTCGAGGAGGCCCGGGAGATC 
250 260 270 280 290 300 

+40 +50 
PheLysAspAlaGluArgThrLysLeuPheTrpIleSerTyrSerAspGlyAspGlnCys 
TT.CAAGGACGCGGAGAGGACGAAGCTGTTCTGGATTTCTTACAGTGATGGGGACCAGTGT 
310 320 330 340 350 360 

+60 +70 
AlaSerSerProCysGlnAsnGlyGlySerCysLysAspGlnLeuGlnSerTyrlleCys 
GCCTCAAGTCCATGCCAGAATGGGGGCTCCTGCAAGGACCAGCTCCAGTCCTATATCTGCT 
370 380 390 400 410 420 

+80 +90 
PheCysLeuProAlaPheGluGlyArgAsnCysGluThrHisLysAspAspGlnLeuIle 
TTCTGCCTCCCTGCCTTCGAGGGCCGGAACTGTGAGACGCACAAGGATGACCAGCTGATC 
430 440 450 460 470 480 

+100 +110 
CysValAsnGluAsnGlyGlyCysGluGlnTyrCysSerAspHisThrGlyThrLysArg 
TGTGTGAACGAGAACGGCGGCTGTGAGCAGTACTGCAGTGACCACACGGGCACCAAGCGC 
490 500 510 520 530 540 

+120 +130 
SerCysArgCysHisGluGlyTyr SerLeuLeuAlaAspGlyValSerCysThrProThr 
TCCTGTCGGTGCCACGAGGGGTACTCTCTGCTGGCAGACGGGGTGTCCTGCACACCCACA 
550 560 570 580 590 600 

+140 +150 
ValGluTyrProCysGlyLysIleProIleLeuGluLysArgAsnAlaSerLysProGln 
GTTGAATATCCATGTGGAAAAATACCTATTCTAGAAAAAAGAAATGCCAGCAAACCCCAA 
610 620 630 640 650 660 
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+160 +170 
GlyArglleValGlyGlyLysValCysProLysGlyGluCysProTrpGlnValLeuLeu 
GGCCGAATTGTGGGGGGCAAGGTGTGCCCCAAAGGGGAGTGTCCATGGCAGGTCCTGTTG 
670 680 690 700 . 710 720 

+180 +190 
LeuValAsnGlyAlaGlnLeuCysGlyGlyThrLeuIleAsnThrlleTrpValValSer 
TTGGTGAATGGAGCTCAGTTGTGTGGGGGGACCCTGATCAACACCATCTGGGTGGTCTCC 
730 740 750 760 770 780 

+200 +210 
AlaAlaHisCysPheAspLysIleLysAsnTrpArgAsnLeuIleAlaValLeuGlyGlu 
GCGGCCCACTGTTTCGACAAAATCAAGAACTGGAGGAACCTGATCGCGGTGCTGGGCGAG 
790 880 810 820 830 840 

+220 +230 
HisAspLeuSerGluHisAspGlyAspGluGlnSerArgArgValAlaGlnValTlelle 
CACGACCTCAGCGAGCACGACGGGGATGAGCAGAGCCGGCGGGTGGCGCAGGTCATCATC 
850 860 870 880 890 900 

+240 +250 
ProSerThrTyrValProGlyThrThrAsnHisAspIleAlaLeuLeuArgLeuHisGln 
CCCAGCACGTACGTCCCGGGCACCACCAACCACGACATCGCGCTGCTCCGCCTGCACCAG 
910 920 930 940 950 960 

+260 +270 
ProValValLeuThrAspHisValValProLeuCysLeuProGluArgThrPheSerGlu 
CCCGTGGTCCTCACPGACCATGTGGTGCCCCrCTGCCTGCCCGAACGGACGTTCTCTGAG 
970 980 990 1000 1010 1020 

+280 +290 
ArgThrLeuAlaPheValArgPheSerLeuValSerGlyTrpGlyGlnLeuLeuAspArg 
AGGACGCTGGCCTTCGTGCGCrTCTCATTGGTCAGCGGCTGGGGCCAGCTGCTGGACCGT 
1030 1040 1050 1060 1070 1080 

+300 +310 
GlyAlaThrAlaLeuGluLeuMetValLeuAsnValProArgLeuMetThrGlnAspCys 
GGCGCCACGGCCCTGGAGCTCATGGTCCTCAACGTGCCCCGGCTGATGACCCAGGACTGC 
1090 1100 1110 1120 1130 1140 

+320 +330 
LeuGlnGlnSerArgLysValGlyAspSerProAsnlleThrGluTyrMet PheCysAla 
CTGCAGCAGTCACGGAAGGTGGGAGACTCCCCAAATATCACGGAGTACATGTTCTGTGCC 
1150 1160 1170 1180 1190 1200 

+340 +350 
GlyTyrSerAspGlySerLysAspSerCysLysGlyAspSerGlyGlyProHisAlaThr 
GGCTACTCGGATGGCAGCAAGGACTCCTGCAAGGGGGACAGTGGAGGCCCACATGCCACC 
1210 1220 1230 1240 1250 1260 

+360 +370 
HisTyr ArgGlyThrTrpTyrLeuThrGlylleValSerTrpGlyGlnGlyCysAlaThr 
CACTACCGGGGCACGTGGTACCTGACGGGCATCGTCAGCTGGGGCCAGGGCTGCGCAACC 
1270 1280 1290 1300 1310 1320 
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+380 +390 
ValGlyHisPheGlyValTyrThrArgValSerGlnTyrlleGluTrpLeuGlnLysLeu 
GTGGGCCACTTTGGGGTGTACACCAGGGTCTCCCAGTACATCGAGTGGCTGCAAAAGCTC 
1330 1340 1350 1360 1370 1380 

+400 +406 

MetArgSerGluProArgProGlyValLeuLeuArgAlaProPhePro*** 

ATGCGCTCAGAGCCACGCCCAGGAGTCCTCCTGCGAGCCCCATTTCCCTAGCCCAGCAGC 
1390 1400 1410 1420 1430 1440 

CCTGGCCTGTGGAGAGAAAGCCAAGGCTGCGTCGAACTGTCCTGGCACCAAATCCCATAT 
1450 1460 1470 1480 1490 1500 

ATTCTTCTGCAGTTAATGGGGTAGAGGAGGGCATGGGAGGGAGGGAGAGGTGGGGAGGGA 
1510 1520 1530 1540 1550 1560 

GACAGAGACAGAAACAGAGAGAGACAGAGACAGAGAGAGACTGAGGGAGAGACTCTGAGG 
1570 1580 1590 1600 1610 1620 

ACATGGAGAGAGACTCAAAGAGACTCCAAGATTCAAAGAGACTAATAGAGACACAGAGAT 
1630 1640 1650 1660 1670 1680 

GGAATAGAAAAGATGAGAGGCAGAGGCAGACAGGCGCTGGACAGAGGGGCAGGGGAGTGC 
1690 1700 1710 1720 1730 1740 

CAAGGTTGTCCTGGAGGCAGACAGCCCAGCTGAGCCTCCTTACCTCCCTTCAGCCAAGCC 
1750 1760 1770 1780 1790 1800 

CCACCTGCACGTGATCTGCTGGCCCTCAGGCTGCTGCTCTGCCTTCATTGCTGGAGACAG 
1810 1820 1830 1840 1850 1860 

TAGAGGCATGAACACACATGGATGCACACACACACACGCCAATGCACACACACAGAGATA 
1870 1880 1890 1900 1910 ~ 1920 

TGCACACACACGGATGCACACACAGATGGTCACACAGAGATACGCAAACACACCGATGCA 
1930 1940 1950 1960 1970 1980 

CACGCACATAGAGATATGCACACACAGATGCACACACAGATATACACATGGATGCACGCA 
1990 2000 2010 2020 2030 2040 

CATGCCAATGCACGCACACATCAGTGCACACGGATGCACAGAGATATGCACACACCGATG 
2050 2060 2070 2080 2090 2100 

TGCGCACACACAGATATGCACACACATGGATGAGCACACACACACCAAGTGCGCACACAC 
2110 2120 2130 2140 2150 2160 

ACCGATGTACACACACAGATGCACACACAGATGCACACACACCGATGCTGACTCCATGTG 
2170 2180 2190 2200 2210 2220 

TGCTGTCCTCTGAAGGCGGTTGTTTAGCTCTCACTTTTCTGGTTCTTATCCATTATCATC 
2230 2240 2250 2260 2270 2280 

TTCACTTCAGACAATTCAGAAGCATCACCATGCATGGTGGCGAATGCCCCCAAACTCTCC 
2290 2300 2310 2320 2330 2340 

CCCAAATGTATTTCTCCCTTCGCTGGGTGCCGGGCTGCACAGACTATTCCCCACCTGCTT 
2350 2360 2370 2380 2390 2400 
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» 111 l»l — ■— I I I » - — I »■« I II. I I II I MU M I I ■ II 

CCCAGCTTCACAATAAACGGCTGCGTCTCCTCC6CACACCTGTGGT6CCTGCCACCCAAA 
2410 2420 2430 2240 2450 2460 

AAAAAAAAAAAAAAAAAA 
2470 2480 
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FIG. 6 




FIG.8 



0 200 421 



FIG. 7 

GGATCC ATG CAG CGC GTG AAC ATG ATC ATG GCA GAA TCA CCA GGC 
MET Gin Arg Val Asn MET lie MET Ala Glu Ser Pro Gly 

66 81 
CTC ATC ACC ATC TGC CTT TTA GGA TAT CTA CTC AGT GCT GAA TGT 
Leu lie Thr lie Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys 

96 111 126 

ACA GTT TTT CTT GAT CAT GAA AAC GCC AAC AAA ATT CTG AAT CGG 
Thr Val Phe Leu Asp His Glu Asn Ala Asn Lys lie Leu Asn Arg 

141 156 171 

CCA AAG AGG TAT AAT TCA GGT AAA TTG GAA GAG TTT GTT CAA GGG 
Pro Lys Arg Tyr Asn Ser Gly Lys Leu Glu Glu Phe Val Gin Gly 

186 201 216 

AAC CTT GAG AGA GAA TGT ATG GAA GAA AAG TGT AGT TTT GAA GAA 
Asn Leu Glu Arg Glu Cys MET Glu Glu Lys Cys Ser Phe Glu Glu 

231 246 261 

GCA CGA GAA GTT TTT GAA AAC ACT GAA AGA ACA AAG CTG. TTC TGG 
Ala Arg Glu Val Phe Glu Asn Thr Glu Arg Thr Lys Leu Phe Trp 

276 291 306 

ATT TCT TAC AGT GAT GGG GAC CAG TGT GCC TCA AGT CCA TGC CAG 
lie Ser Tyr Ser Asp Gly Asp Gin Cys Ala Ser Ser Pro Cys Gin 

321 336 351 

AAT GGG GGC TCC TGC AAG GAC CAG CTC CAG TCC TAT ATC TGC TTC 
Asn Gly Gly Ser Cys Lys Asp Gin Leu Gin Ser Tyr lie Cys Phe 

366 381 396 

TGC CTC CCT GCC TTC GAG GGC CGG AAC TGT GAG ACG CAC AAG GAT 
Cys Leu Pro Ala Phe Glu Gly Arg Asn Cys Glu Thr His Lys Asp 

411 426 441 

GAC CAG CTG ATC TGT GTG AAC GAG AAC GGC GGC TGT GAG CAG TAC 
Asp Glu Leu lie Cys Val Asn Glu Asn Gly Gly Cys Glu Gin Tyr 

456 471 486 

TGC AGT GAC CAC ACG GGC ACC AAG CGC TCC TGT CGG TGC CAC GAG 
Cys Ser Asp His Thr Gly Thr Lys Arg Ser Cys Arg Cys His Glu 

501 516 531 

GGG TAC TCT CTG CTG GCA GAC GGG GTG TCC TGC ACA CCC ACA GTT 
Gly Tyr Ser Leu Leu Ala Asp Gly Val Ser Cys Thr Pro Thr Val 

546 561 576 

GAA TAT CCA TCT GGA AAA ATA CCT ATT CTA GAA AAA AGA AAT GCC 
Glu Tyr Pro Cys Gly Lys lie Pro He Leu Glu Lys Arg Asn Ala 

591 606 621 

AGC AAA CCC CAA GGC CGA ATT GTG GGG GGC AAG GTG TGC CCC AAA 
Ser Lys Pro Gin Gly Arg He Val Gly Gly Lys Val Cys Pro Lys 
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636 651 666 

GGG GAG TGT CCA TGG CAG GTC CTG TT6 TTG GTG AAT GGA GCT CAG 
Gly Glu Cys Pro Trp Gin Val Leu Leu Leu Val Asn Gly Ala Gin 

681 696 711 

TTG TGT GGG GGG ACC CTG ATC AAC ACC ATC TGG GTG GTC TCC GCG 
Leu Cys Gly Gly Thr Leu He Asn Thr He Trp Val Val Ser Ala 

726 741 756 

GCC CAC TGT TTC GAC AAA ATC AAG AAC TGG AGG AAC CTG ATC GCG 
Ala His Cys Phe Asp Lys He Lys Asn Trp Arg Asn Leu He Ala 

771 786 801 

GTG CTG GGC GAG CAC GAC CTC AGC GAG CAC GAC GGG GAT GAG CAG 
Val Leu Gly Glu His Asp Leu Ser Glu His Asp Gly Asp Glu Gin 

816 831 846 

AGC CGG CGG GTG GCG CAG GTC ATC ATC CCC AGC ACG TAC GTC CCG 
Ser Arg Arg Val Ala Gin Val He He Pro Ser Thr Tyr Val Pro 

861 876 891 

GGC ACC ACC AAC CAC GAC ATC GCG CTG CTC CGC CTG CAC CAG CCC 
Gly Thr Thr Asn His Asp He Ala Leu Leu Arg Leu His Gin Pro 

906 921 936 

GTG GTC CTC ACT GAC CAT GTG GTG CCC CTC TGC CTG CCC GAA CGG 
Val Val Leu Thr Asp His Val Val Pro Leu Cys Leu Pro Glu Arg 

951 966 981 

ACG TTC TCT GAG AGG ACG CTG GCC TTC GTG CGC TTC TCA TTG GTC 
Thr Phe Ser Glu Arg Thr Leu Ala Phe Val Arg Phe Ser Leu Val 

996 1011 1026 

AGC GGC TGG GGC CAG CTG CTG GAC CGT GGC GCC ACG GCC CTG GAG 
Ser Gly Trp Gly Gin Leu Leu Asp Arg Gly Ala Thr Ala Leu Glu 

1041 1056 1071 

CTC ATG GTC CTC AAC GTG CCC CGG CTG ATG ACC CAG GAC TGC CTG 
Leu MET Val Leu Asn Val Pro Arg Leu MET Thr Gin Asp Cys Leu 

1086 H01 1H6 

CAG CAG TCA CGG AAG GTG GGA GAC TCC CCA AAT ATC ACG GAG TAC 
Gin Gin Ser Arg Lys Val Gly Asp Ser Pro Asn He Thr Glu Tyr 

1131 1146 H61 

ATG TTC TGT GCC GGC TAC TCG GAT GGC AGC AAG GAC TCC TGC AAG 
MET Phe Cys Ala Gly Tyr Ser Asp Gly Ser Lys Asp Ser Cys Lys 

1176 H91 ' 1206 

GGG GAC AGT GGA GGC CCA CAT GCC ACC CAC TAC CGG GGC ACG TGG 
Gly Asp Ser Gly Gly Pro His Ala Thr His Tyr Arg Gly Thr Trp 

1221 1236 1251 

TAC CTG ACG GGC ATC GTC AGC TGG GGC CAG GGC TGC GCA ACC GTG 
Tyr Leu Thr gly He Val Ser Trp Gly Gin Gly C ys Ala Thr Val 
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1266 1281 1296 

GGC CAC TTT 6GG GTG TAC ACC AGG GTC TCC CAG TAC ATC GAG TGG 
Gly his Phe Gly Val Tyr Thr Arg Val Ser Gin Tyr lie Glu Trp 

1311 1326 1341 

CTG CAA AAG CTC ATG CGC TCA GAG CCA CGC CCA GGA GTC CTC CTG 
Leu Gin Lys Leu MET Arg Ser Glu Pro Arg Pro Gly Val Leu Leu 



1356 

CGA GCC CCA TTT CCC TAG 
Arg Ala Pro Phe Pro 



1378 1388 1398 

CCCAGCAGCC CTGGCCTGTG GAGAGAAAGC 



1408 1418 1428 1438 1448 

CAAGGCTGCG TCGAACTGTC CTGGCACCAA ATCCCATATA TTCTTCTGCA 



1458 1468 1478 1488 1498 

GTTAATGGGG TAGAGGAGGG CATGGGAGGG AGGGAGAGGT GGGGAGGGAG 



1508 1518 1528 1538 1548 

ACAGAGACAG AAACAGAGAG AGACAGAGAC AGAGAGAGAC TGAGGGAGAG 



1558 1568 1578 1588 1598 

ACTCTGAGGA CCATGGAGAG AGACTCAAAG AGACTCCAAG ATTCAAAGAG 



1608 1618 1628 1638 1648 

ACTAATAGAG ACACAGAGAT GGAATAGAAA AGATGAGAGG CAGAGGCAGA 



1658 1668 1678 16.88 1698 

CAGGCGCTGG ACAGAGGGGC AGGGGAGTGC CAAGGTTGTC CTGGAGGCAG 



1708 1718 1728 1738 1748 

ACAGCCCAGC TGAGCCTCCT TACCTCCCTT CAGCCAAGCC CCACCTGCAC 



1758 1768 1778 1788 1798 

GTGATCTGCT GGCCCTCAGG CTGCTGCTCT GCCTTCATTG CTGGAGACAG 



1808 1818 1828 1838 1848 

TAGAGGCATG ACACACATGG ATGCACACAC ACACACGCCA TGCACACACA 



1858 1868 1878 1888 1898 

CAGAGATATG CACACACACG GATGCACACA CAGATGGTCA CACAGAGTAC 



1908 1918 1928 1938 1948 

GCAAACACAC CGATGCACAC GCACATAGAG ATATGCACAC ACAGATGCAC 
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1958 1968 1978 1988 1998 

ACACAGATAT ACACATGGAG TGCACGCACA TGCCAATGCA CGCACACATC 



2008 2018 2028 2038 2048 

AGTGCACACG GATGCACAGA GATATGCACA CACCGATGTG CGCACACACA 



2058 2068 2078 2088 2098 

GATATGCACA CACATGGATG AGCACACACA CACCAAGTGC GCACACACAC 



2108 2118 2128 2138 2148 

CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACCG ATGCTGACTC 



2158 2168 2178 2188 2198 

CATGTGTGCT GTCCTCTGAA GGCGGTTGTT TAGCTCTCAC TTTTCTGGTT 



2208 2218 2228 2238 2248 

CTTATCCATT ATCATCTTCA CTTCAGACAA TTCAGAAGCA TCACCATGCA 



2258 2268 2278 2288 2298 

TGGTGGCGAA TGCCCCCAAA CTCTCCCCCA AATGTATTTC TCCCTTCGC1 



2308 2318 2328 2338 2348 

GGGTGCCGGG CTGCACAGAC TATTCCCCAC CTGCTTCCCA GCTTCACAAT 



2358 2368 2378 2388 2398 

AAACGGCTGC GTCTCCTCGC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



2408 2418 2428 2438 

AAAAAAAAAA AAGGAATTCG AGCTCGGTAC CCGGGGATCC 
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